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Preface 


This book is intended as a text for a one- or two-semester introduction to topology, at 
the senior or first-year graduate level. 

The subject of topology is of interest in its own right, and it also serves to lay the 
foundations for future study in analysis, in geometry, and in algebraic topology. There 
is no universal agreement among mathematicians as to what a first course in topology 
should include; there are many topics that are appropriate to such a course, and not all 
are equally relevant to these differing purposes. In the choice of material to be treated, 
I have tried to strike a balance among the various points of view. 


Prerequisites. There are no formal subject matter prerequisites for studying most of 
this book. I do not even assume the reader knows much set theory. Having said that, 
I must hasten to add that unless the reader has studied a bit of analysis or *‘rigorous 
calculus,” much of the motivation for the concepts introduced in the first part of the 
book will be missing. Things will go more smoothly if he or she already has had some 
experience with continuous functions, open and closed sets, metric spaces, and the 
like, although none of these is actually assumed. In Part II, we do assume familiarity 
with the elements of group theory. 

Most students in a topology coursé have, in my experience, some knowledge of 
the foundations of mathematics. But the amount varies a great deal from one student 
to another. Therefore, I begin with a fairly thorough chapter on set theory and logic. It 
starts at an elementary level and works up to a level that might be described as “semi- 
sophisticated.” It treats those topics (and only those) that will be needed later in the 
book. Most students will already be familiar with the material of the first few sections, 
but many of them will find their expertise disappearing somewhere about the middle 
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of the chapter. How much time and effort the instructor will need to spend on this 
chapter will thus depend largely on the mathematical sophistication and experience of 
the students. Ability to do the exercises fairly readily (and correctly!) should serve as 
a reasonable criterion for determining whether the student’s mastery of set theory is 
sufficient for the student to begin the study of topology. 

Many students (and instructors!) would prefer to skip the foundational material 
of Chapter 1 and jump right in to the study of topology. One ignores the foundations, 
however, only at the risk of later confusion and error. What one can do is to treat 
initially only those sections that are needed at once, postponing the remainder until 
they are needed. The first seven sections (through countability) are needed throughout 
the book; I usually assign some of them as reading and lecture on the rest. Sections 9 
and 10, on the axiom of choice and well-ordering, are not needed until the discussion 
of compactness in Chapter 3. Section 11, on the maximum principle, can be postponed 
even longer; it is needed only for the Tychonoff theorem (Chapter 5) and the theorem 
on the fundamental group of a linear graph (Chapter 14). 

How the book is organized. This book can be used for a number of different courses. 
I have attempted to organize it as flexibly as possible. so as to enable the instructor to 
follow his or her own preferences in the matter. 

Part I, consisting of the first eight chapters, is devoted to the subject commonly 
called general topology. The first four chapters deal with the body of material that, 
in my opinion, should be included in any introductory topology course worthy of the 
name. This may be considered the “irreducible core” of the subject, treating as it does 
set theory, topological spaces, connectedness, compactness (through compactness of 
finite products), and the countability and separation axioms (through the Urysohn 
metrization theorem). The remaining four chapters of Part I explore additional topics; 
they are essentially independent of one another, depending on only the core material 
of Chapters 1-4. The instructor may take them up in any order he or she chooses. 

Part II constitutes an introduction to the subject of Algebraic Topology. It depends 
on only the core material of Chapters 1—4. This part of the book treats with some 
thoroughness the notions of fundamental group and covering space, along with their 
many and varied applications. Some of the chapters of Part II are independent of one 
another, the dependence among them is expressed in the following diagram: 


Chapter9 The Fundamental Group 


aa 


Chapter 10 Separation Theorems in the Plane 


Chapter 11 The Seifert-van Kampen Theorem 


l 


Chapter 12 Classification of Surfaces 


Chapter 13 Classification of Covering Spaces 


oe 


Chapter 14 Applications to Group Theory 
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Certain sections of the book are marked with an asterisk; these sections may be 
omitted or postponed with no loss of continuity. Certain theorems are marked sim- 
ilarly. Any dependence of later material on these asterisked sections or theorems is 
indicated at the time, and again when the results are needed. Some of the exercises 
also depend on earlier asterisked material, but in such cases the dependence is obvious. 

Sets of supplementary exercises appear at the ends of several of the chapters. They 
provide an opportunity for exploration of topics that diverge somewhat from the main 
thrust of the book; an ambitious student might use one as a basis for an independent 
paper or research project. Most are fairly self-contained, but the one on topological 
groups has as a sequel a number of additional exercises on the topic that appear in later 
sections of the book. 


Possible course outlines. Most instructors who use this text for a course in general 
topology will wish to cover Chapters 1—4, along with the Tychonoff theorem in Chap- 
ter 5. Many will cover additional topics as well. Possibilities include the following: 
the Stone-Cech compactification (§38), metrization theorems (Chapter 6), the Peano 
curve (§44), Ascoli’s theorem (§45 and/or §47), and dimension theory (§50). I have, 
in different semesters, followed each of these options. 

For a one-semester course in algebraic topology, one can expect to cover most of 
Part I. 

It is also possible to treat both aspects of topology in a single semester, although 
with some corresponding loss of depth. One feasible outline for such a course would 
consist of Chapters 1-3, followed by Chapter 9; the latter does not depend on the 
material of Chapter 4. (The non-asterisked sections of Chapters 10 and 13 also are 
independent of Chapter 4.) 


Comments on this edition. The reader who is familiar with the first edition of this 
book will find no substantial changes in the part of the book dealing with general 
topology. I have confined myself largely to “fine-tuning” the text material and the 
exercises. However, the final chapter of the first edition, which dealt with algebraic 
topology, has been substantially expanded and rewritten. It has become Part II of this 
book. In the years since the first edition appeared, it has become increasingly common 
to offer topology as a two-term course, the first devoted to general topology and the 
second to algebraic topology. By expanding the treatment of the latter subject, I have 
intended to make this revision serve the needs of such a course. 


Acknowledgments. Most of the topologists with whom I have studied, or whose 
books I have read, have contributed in one way or another to this book; I mention 
only Edwin Moise, Raymond Wilder, Gail Young, and Raoul Bott, but there are many 
others. For their helpful comments concerning this book, my thanks to Ken Brown, 
Russ McMillan, Robert Mosher, and John Hemperly, and to my colleagues George 
Whitehead and Kenneth Hoffman. 

The treatment of algebraic topology has been substantially influenced by the excel- 
lent book by William Massey [M], to whom I express appreciation. Finally, thanks are 
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due Adam Lewenberg of MacroTeX for his extraordinary skill and patience in setting 
text and juggling figures. 

But most of all, to my students go my most heartfelt thanks. From them I learned 

at least as much as they did from me; without them this book would be very different. 

J.R.M. 


A Note to the Reader 


Two matters require comment—the exercises and the examples. 

Working problems is a crucial part of learning mathematics. No one can learn 
topology merely by poring over the definitions, theorems, and examples that are worked 
out in the text. One must work part of it out for oneself. To provide that opportunity is 
the purpose of the exercises. 

They vary in difficulty, with the easier ones usually given first. Some are routine 
verifications designed to test whether you have understood the definitions or examples 
of the preceding section. Others are less routine. You may, for instance, be asked to 
generalize a theorem of the text. Although the result obtained may be interesting in its 
own right, the main purpose of such an exercise is to encourage you to work carefully 
through the proof in question, mastering its ideas thoroughly—more thoroughly (I 
hope!) than mere memorization would demand. 

Some exercises are phrased in an “open-ended” fashion Students often find this 
practice frustrating. When faced with an exercise that asks, “Is every regular Lindelof 
space normal?” they respond in exasperation, “I don’t know what I’m supposed to do! 
Am I suppose to prove it or find a counterexample or what?” But mathematics (outside 
textbooks) is usually like this. More often than not, all a mathematician has to work 
with is a conjecture or question, and he or she doesn’t know what the correct answer 
is. You should have some experience with this situation. 

A few exercises that are more difficult than the rest are marked with asterisks. But 
none are So difficult but that the best student in my class can usually solve them. 
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Another important part of mastering any mathematical subject is acquiring a reper- 
toire of useful examples. One should, of course, come to know those major examples 
from whose study the theory itself derives, and to which the important applications 
are made. But one should also have a few counterexamples at hand with which to test 
plausible conjectures. 

Now it is all too easy in studying topology to spend too much time dealing with 
“weird counterexamples.” Constructing them requires ingenuity and is often great 
fun. But they are not really what topology is about. Fortunately, one does not need 
too many such counterexamples for a first course; there is a fairly short list that will 
suffice for most purposes. Let me give it here: 


R” the product of the real line with itself, in the product, uniform, and box topolo- 
gies. 


Ry the real line in the topology having the intervals (a, b) as a basis. 
Sg the minimal uncountable well-ordered set. 


1 the closed unit square in the dictionary order topology. 


These are the examples you should master and remember; they will be exploited 
again and again. 


Part I 
GENERAL TOPOLOGY 


Chapter 1 


Set Theory and Logic 


We adopt, as most mathematicians do, the naive point of view regarding set theory. 
We shall assume that what is meant by a set of objects is intuitively clear, and we shall 
proceed on that basis without analyzing the concept further. Such an analysis properly 
belongs to the foundations of mathematics and to mathematical logic, and it is not our 
purpose to initiate the study of those fields. 


Logicians have analyzed set theory in great detail, and they have formulated ax- 
ioms for the subject. Each of their axioms expresses a property of sets that mathe- 
maticians commonly accept, and collectively the axioms provide a foundation broad 
enough and strong enough that the rest of mathematics can be built on them. 


It is unfortunately true that careless use of set theory, relying on intuition alone, 
can lead to contradictions. Indeed, one of the reasons for the axiomatization of set 
theory was to formulate rules for dealing with sets that would avoid these contradic- 
tions. Although we shall not deal with the axioms explicitly, the rules we follow in 
dealing with sets derive from them. In this book, you will learn how to deal with sets 
in an “apprentice” fashion, by observing how we handle them and by working with 
them yourself. At some point of your studies, you may wish to study set theory more 
carefully and in greater detail; then a course in logic or foundations will be in order. 
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4 Set Theory and Logic Ch. I 


§1 Fundamental Concepts 


Here we introduce the ideas of set theory, and establish the basic terminology and 
notation. We also discuss some points of elementary logic that, in our experience, are 
apt to cause confusion. 


Basic Notation 


Commonly we shall use capital letters A, B, ... to denote sets, and lowercase letters 
a, b, ... to denote the objects or elements belonging to these sets. If an object a 
belongs to a set A, we express this fact by the notation 


acA. 
If a does not belong to A, we express this fact by writing 
a ¢A. 


The equality symbol = is used throughout this book to mean logical identity. Thus, 
when we write a = b, we mean that “a” and “b” are symbols for the same object. This 
is what one means in arithmetic, for example, when one writes A = 5. Similarly, the 
equation A = B states that “A” and “B” are symbols for the same set; that is, A and B 
consist of precisely the same objects. 

If a and b are different objects, we write a Æ b; and if A and B are different sets, 
we write A Æ B. For example, if A is the set of all nonnegative real numbers, and B 
is the set of all positive real numbers, then A Æ B, because the number 0 belongs to A 
and not to B. 

We say that A is a subset of B if every element of A is also an element of B; and 
we express this fact by writing 


ACB. 


Nothing in this definition requires A to be different from B; in fact, if A = B, itis true 
that both A C Band B C A. If A C B and A is different from B, we say that A is a 
proper subset of B, and we write 


AGB. 


The relations C and Ç are called inclusion and proper inclusion, respectively. If 
A C B, we also write B D A, which is read “B contains A.” 

How does one go about specifying a set? If the set has only a few elements, one 
can simply list the objects in the set, writing “A is the set consisting of the elements a, 
b, and c.” In symbols, this statement becomes 


A = {a,b,c}, 


where braces are used to enclose the list of elements. 
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The usual way to specify a set, however, is to take some set A of objects and some 
property that elements of A may or may not possess, and to form the set consisting 
of all elements of A having that property. For instance, one might take the set of 
real numbers and form the subset B consisting of all even integers. In symbols, this 
statement becomes 


B = {x | x is an even integer}. 


Here the braces stand for the words “the set of,” and the vertical bar stands for the 
words “‘such that.” The equation is read “B is the set of all x such that x is an even 
integer.” 


The Union of Sets and the Meaning of “or” 


Given two sets A and B, one can form a set from them that consists of all the elements 
of A together with all the elements of B. This set is called the union of A and B and 
is denoted by A U B. Formaily, we define 


AUB={x|x€Aorx € B}. 


But we must pause at this point and make sure exactly what we mean by the statement 
“xe Aorx € B” 

In ordinary everyday English, the word “or” is ambiguous. Sometimes the state- 
ment “P or Q” means “P or Q, or both” and sometimes it means “P or Q, but not 
both.” Usually one decides from the context which meaning is intended. For example, 
suppose I spoke to two students as follows: 


“Miss Smith, every student registered for this course has taken either a course in 
linear algebra or a Course in analysis.” 


“Mr. Jones, either you get a grade of at least 70 on the final exam or you will flunk 
this course ” 


In the context, Miss Smith knows perfectly well that I mean “everyone has had linear 
algebra or analysis, or both,” and Mr. Jones knows I mean “either he gets at least 70 
or he flunks, but not both.” Indeed, Mr. Jones would be exceedingly unhappy if both 
statements turned out to be true! 

In mathematics, one cannot tolerate such ambiguity. One has to pick just one 
meaning and stick with it, or confusion will reign. Accordingly, mathematicians have 
agreed that they will use the word “or” in the first sense, so that the statement ‘‘P or Q” 
always means “P or Q, or both.” If one means “P or Q, but not both,” then one has to 
include the phrase “but not both” explicitly. 

With this understanding, the equation defining A U B is unambiguous; it states that 
A U B is the set consisting of all elements x that belong to A or to B or to both. 
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The Intersection of Sets, the Empty Set, and the Meaning of “If... Then” 


Given sets A and B, another way one can form a set is to take the common part of A 
and B. This set is called the intersection of A and B and is denoted by ANB. Formally, 
we define 


ANB=({x|x € Aandx €B}. 


But just as with the definition of A U B, there is a difficulty. The difficulty is not in the 
meaning of the word “and”; it is of a different sort. It arises when the sets A and B 
happen to have no elements in common. What meaning does the symbol A N B have 
in such a case? 

To take care of this eventuality, we make a special convention. We introduce a 
special set that we call the empty set, denoted by Ø, which we think of as “the set 
having no elements.” 

Using this convention, we express the statement that A and B have no elements in 
common by the equation 


ANB =Ø. 


We also express this fact by saying that A and B are disjoint. 

Now some students are bothered by the notion of an “empty set.” “How,” they say, 
“can you have a set with nothing in it?” The problem is similar to that which arose 
many years ago when the number 0 was first introduced. 

The empty set is only a convention, and mathematics could very well get along 
without it. But it is a very convenient convention, for it saves us a good deal of 
awkwardness in stating theorems and in proving them. Without this convention, for 
instance, one would have to prove that the two sets A and B do have elements in 
common before one could use the notation A N B. Similarly, the notation 


C = {x |x € A and x has a certain property) 


could not be used if it happened that no element x of A had the given property. It is 
much more convenient to agree that A N B and C equal the empty set in such cases. 

Since the empty set Ø is merely a convention, we must make conventions relating 
it to the concepts already introduced. Because Ø is thought of as “the set with no 
elements,” it is clear we should make the convention that for each object x, the relation 
x € @ does not hold. Similarly, the definitions of union and intersection show that for 
every set A we should have the equations 


AU@=A and ANSD=2. 


The inclusion relation is a bit more tricky. Given a set A, should we agree that 
Ø C A? Once more, we must be careful about the way mathematicians use the English 
language. The expression @ C A is a shorthand way of writing the sentence, “Every 
element that belongs to the empty set also belongs to the set A.” Or to put it more 
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formally, “For every object x, if x belongs to the empty set, then x also belongs to the 
set A.” 

Is this statement true or not? Some might say “yes” and others say “no.” You 
will never settle the question by argument, only by agreement. This is a statement of 
the form “If P, then Q,” and in everyday English the meaning of the “if ... then” 
construction is ambiguous. It always means that if P is true, then Q is true also. 
Sometimes that is all it means; other times it means something more: that if P is false, 
Q must be false. Usually one decides from the context which interpretation is correct. 

The situation is similar to the ambiguity in the use of the word “or.” One can refor- 
mulate the examples involving Miss Smith and Mr. Jones to illustrate the ambiguity. 
Suppose I said the following: 


“Miss Smith, if any student registered for this course has not taken a course in 
linear algebra, then he has taken a course in analysis.” 


“Mr. Jones, if you get a grade below 70 on the final, you are going to flunk this 
course.” 


In the context, Miss Smith understands that if a student in the course has not had linear 
algebra, then he has taken analysis, but if he has had linear algebra, he may or may not 
have taken analysis as well. And Mr. Jones knows that if he gets a grade below 70, he 
will flunk the course, but if he gets a grade of at least 70, he will pass. 

Again, mathematics cannot tolerate ambiguity, so a choice of meanings must be 
made. Mathematicians have agreed always to use “if ... then” in the first sense, so 
that a statement of the form “If P, then Q” means that if P is true, Q is true also, but 
if P is false, Q may be either true or false. 

As an example, consider the following statement about real numbers: 


Ifx > 0, then x? £0. 


It is a statement of the form, “If P, then Q,” where P is the phrase “x > O” (called 
the hypothesis of the statement) and Q is the phrase “x? # 0” (called the conclusion 
of the statement). This is a true statement, for in every case for which the hypothesis 
x > Oholds, the conclusion x? # 0 holds as well. 

Another true statement about real numbers is the following: 


If x? < 0, then x = 23; 


in every case for which the hypothesis holds, the conclusion holds as well. Of course, 
it happens in this example that there are no cases for which the hypothesis holds. A 
statement of this sort is sometimes said to be vacuously true. 

To retum now to the empty set and inclusion, we see that the inclusion @ C A 
does hold for every set A. Writing Ø C A is the same as saying, “If x € ©, then 
x € A,’ and this statement is vacuously true. 
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Contrapositive and Converse 


Our discussion of the “if... then” construction leads us to consider another point of 
elementary logic that sometimes causes difficulty. It concerns the relation between a 
statement, its contrapositive, and its converse. 

Given a statement of the form “If P, then Q,” its contrapositive is defined to be 
the statement “If Q is not true, then P is not true.” For example, the contrapositive of 
the statement 


If x > 0, then x £0, 
is the statement 
If x? = Q0, then it is not true that x > 0. 
Note that both the statement and its contrapositive are true. Similarly, the statement 
If x? < 0, then x = 23, 
has as its contrapositive the statement 
If x # 23, then it is not true that x? <O. 


Again, both are true statements about real numbers. 

These examples may make you suspect that there is some relation between a state- 
ment and its contrapositive. And indeed there is; they are two ways of saying precisely 
the same thing. Each is true if and only if the other is true; they are logically equiva- 
lent. 

This fact is not hard to demonstrate. Let us introduce some notation first. As a 
shorthand for the statement “If P, then Q”? we write 


P = Q, 
which is read “P implies Q.” The contrapositive can then be expressed in the form 
(not Q) => (not P), 


where “not Q” stands for the phrase "Q is not true.” 

Now the only way in which the statement “P = Q” can fail to be correct is if the 
hypothesis P is true and the conclusion Q is false. Otherwise it is correct. Similarly, 
the only way in which the statement (not Q) = (not P) can fail to be correct is if 
the hypothesis “not Q” is true and the conclusion “not P” is false. This is the same 
as saying that Q is false and P is true. And this, in turn, is precisely the situation in 
which P = Q fails to be correct. Thus, we see that the two statements are either both 
correct or both incorrect; they are logically equivalent. Therefore, we shall accept a 
proof of the statement “not Q = not P” as a proof of the statement "P > Q” 

There is another statement that can be formed from the statement P => Q. It is 
the statement 


Q =P, 
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which is called the converse of P = Q. One must be careful to distinguish between a 
statement’s converse and its contrapositive. Whereas a statement and its contrapositive 
are logically equivalent, the truth of a statement says nothing at all about the truth or 
falsity of its converse. For example, the true statement 


Ifx > 0, then x? £0, 
has as its converse the statement 

Ifx? £0, then x > 0, 
which is false. Similarly, the true statement 

Ifx? < 0, then x = 23, 
has as its converse the statement 

If x = 23, then e< 0, 


which is false. 
If it should happen that both the statement P = Q and its converse Q = P are 
true, we express this fact by the notation 


PQ, 


which is read “P holds if and only if Q holds.” 


Negation 


If one wishes to form the contrapositive of the statement P > Q, one has to know 
how to form the statement “not P,” which is called the negation of P. In many cases, 
this causes no difficulty; but sometimes confusion occurs with statements involving the 
phrases “for every” and “for at least one.” These phrases are called logical quantifiers. 

To illustrate, suppose that X is a set, A is a subset of X, and P is a statement about 
the general element of X. Consider the following statement: 


(*) For every x € A, statement P holds. 


How does one form the negation of this statement? Let us translate the problem into 
the language of sets. Suppose that we let B denote the set of all those elements x 
of X for which P holds. Then statement (*) is just the statement that A is a subset 
of B. What is its negation? Obviously, the statement that A is not a subset of B; that 
is, the statement that there exists at least one element of A that does not belong to B. 
Translating back into ordinary language, this becomes 


For at least one x € A, statement P does not hold. 


Therefore, to form the negation of statement (*), one replaces the quantifier “for every” 
by the quantifier “for at least one,” and one replaces statement P by its negation. 
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The process works in reverse just as well; the negation of the statement 
For at least one x € A, statement Q holds, 
is the statement 


For every x € A, statement Q does not hold. 


The Difference of Two Sets 


We return now to our discussion of sets. There is one other operation on sets that is 
occasionally useful. It is the difference of two sets, denoted by A — B, and defined as 
the set consisting of those elements of A that are not in B. Formally, 


A-—Bz={x|xeéAandx ¢ B). 


It is sometimes called the complement of B relative to A, or the complement of B in A. 
Our three set operations are represented schematically in Figure 1.1. 


8 8 8 
A A A 
AUB ANB A-8 


Figure 1.1 


Rules of Set Theory 


Given several sets, one may form new sets by applying the set-theoretic operations to 
them. As in algebra, one uses parentheses to indicate in what order the operations are 
to be performed. For example, A U (B N C) denotes the union of the two sets A and 
BOC, while (A U B) N C denotes the intersection of the two sets A U B and C. The 
sets thus formed are quite different, as Figure 1.2 shows. 


AU (BNC) (avayne 


Figure 1.2 
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Sometimes different combinations of operations lead to the same set; when that 
happens, one has a rule of set theory. For instance, it is true that for any sets A, B, 
and C the equation 


AN(BUC) =(ANB)U(ANC) 


holds. The equation is illustrated in Figure 1.3; the shaded region represents the set in 
question, as you can check mentally. This equation can be thought of as a “distributive 
law” for the operations N and U. 


c 


Figure 1.3 


Other examples of set-theoretic rules include the second “distributive law,” 
AU(BNC) =(AUB)N(AUC), 
and DeMorgan’s laws, 


A—(BUC)=(A- B)N(A-C), 
A-—(BNC)=(A-B)U(A-C). 


We leave it to you to check these rules. One can state other rules of set theory, but 
these are the most important ones. DeMorgan’s laws are easier to remember if you 
verbalize them as follows: 


The complement of the union equals the intersection of the complements. 
The complement of the intersection equals the union of the complements. 


Collections of Sets 


The objects belonging to a set may be of any sort. One can consider the set of all even 
integers, and the set of all blue-eyed people in Nebraska, and the set of all decks of 
playing cards in the world. Some of these are of limited mathematical interest, we 
admit! But the third example illustrates a point we have not yet mentioned: namely, 
that the objects belonging to a set may themselves be sets. For a deck of cards is itself 
a set, one consisting of pieces of pasteboard with certain standard designs printed on 
them. The set of all decks of cards in the world is thus a set whose elements are 
themselves sets (of pieces of pasteboard). 
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We now have another way to form new sets from old ones. Given a set A, we can 
consider sets whose elements are subsets of A. In particular, we can consider the set 
of all subsets of A. This set is sometimes denoted by the symbol P(A) and is called 
the power set of A (for reasons to be explained later). 

When we have a set whose elements are sets, we shall often refer to it as a collec- 
tion of sets and denote it by a script letter such as A or B. This device will help us 
in keeping things straight in arguments where we have to consider objects, and sets of 
objects, and collections of sets of objects, all at the same time. For example, we might 
use A to denote the collection of all decks of cards in the world, letting an ordinary 
capital letter A denote a deck of cards and a lowercase letter a denote a single playing 
card. 

A certain amount of care with notation is needed at this point. We make a distinc- 
tion between the object a, which is an element of a set A, and the one-element set {a}, 
which is a subset of A. To illustrate, if A is the set {a, b, c), then the statements 


acA, {a} C A, and {a} € P(A) 


are all correct, but the statements {a} € A anda C A are not. 


Arbitrary Unions and Intersections 
We have already defined what we mean by the union and the intersection of two sets. 
There is no reason to limit ourselves to just two sets, for we can just as well form the 
union and intersection of arbitrarily many sets. 

Given a collection A of sets, the union of the elements of A is defined by the 
equation 


U A = {x | x € A for at least one A € A}. 
AEA 


The intersection of the elements of A is defined by the equation 


A A = {x | x € A for every A € A). 
AGA 


There is no problem with these definitions if one of the elements of A happens to be 
the empty set. But it is a bit tricky to decide what (if anything) these definitions mean 
if we allow A to be the empty collection. Applying the definitions literally, we see that 
no element x satisfies the defining property for the union of the elements of A. So it is 
reasonable to say that 

U A=2 


AEA 


if A is empty. On the other hand, every x satisfies (vacuously) the defining property for 
the intersection of the elements of A. The question is, every x in what set? If one has a 
given large set X that is specified at the outset of the discussion to be one’s “universe of 
discourse,” and one considers only subsets of X throughout, it is reasonable to let 


(\4=*x 


AEA 
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when A is empty. Not all mathematicians follow this convention, however. To avoid 
difficulty, we shall not define the intersection when A is empty. 


Cartesian Products 


There is yet another way of forming new sets from old ones; it involves the notion of an 
“ordered pair” of objects. When you studied analytic geometry, the first thing you did 
was to convince yourself that after one has chosen an x-axis and a y-axis in the plane, 
every point in the plane can be made to correspond to a unique ordered pair (x, y) of 
real numbers. (In a more sophisticated treatment of geometry, the plane is more likely 
to be defined as the set of all ordered pairs of real numbers!) 

The notion of ordered pair carries over to general sets. Given sets A and B, we 
define their cartesian product A x B to be the set of all ordered pairs (a, b) for which a 
is an element of A and b is an element of B. Formally, 


Ax B = {(a,b)|a € Aandbe B}. 


This definition assumes that the concept of “ordered pair” is already given. It can be 
taken as a primitive concept, as was the notion of “set”; or it can be given a definition in 
terms of the set operations already introduced. One definition in terms of set operations is 
expressed by the equation 


(a, b) = {{a}, {a, b}}; 


it defines the ordered pair (a, b) as a collection of sets. If a # b, this definition says that 
(a, b} is a collection containing two sets, one of which is a one-element set and the other 
a two-element set. The first coordinate of the ordered pair is defined to be the element 
belonging to both sets, and the second coordinate is the element belonging to only one of 
the sets. If a = b, then (a, b} is a collection containing only one set {a}, since {a, b} = 
{a, a} = {a} in this case. Its first coordinate and second coordinate both equal the element 
in this single set. 

I think it is fair to say that most mathematicians think of an ordered pair as a primitive 
concept rather than thinking of it as a collection of sets! 


Let us make a comment on notation. It is an unfortunate fact that the notation (a, b) 
is firmly established in mathematics with two entirely different meanings. One mean- 
ing, as an ordered pair of objects, we have just discussed. The other meaning is the 
one you are familiar with from analysis; if a and b are real numbers, the symbol (a, b) 
is used to denote the interval consisting of all numbers x such thata < x < b. Most of 
the time, this conflict in notation will cause no difficulty because the meaning will be 
clear from the context. Whenever a situation occurs where confusion is possible, we 
shall adopt a different notation for the ordered pair (a, b), denoting it by the symbol 


axb 


instead. 
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Exercises 


1. Check the distributive laws for U and N and DeMorgan’s laws. 


2. Determine which of the following statements are true for all sets A, B, C, and D. 
If a double implication fails, determine whether one or the other of the possible 
implications holds. If an equality fails, determine whether the statement be- 
comes true if the “equals” symbol is replaced by one or the other of the inclusion 
symbols C or D. 

(a) ACBanAcCCSAC(BUC). 

(b) AC BorACC SAC (BUC). 

(c) AC BandACCSHAC(BNC). 

(dd ACBorACCSAC(BNC). 

(e) A-(A-—B)=B. 

() A-(B—A)=A-B. 

(g) AN(B—C) = (ANB) —(ANC). 

(h) AU(B-—C)=(AUB)-(AUC). 

(i) (AN B)U (A — B) = A. 

G) ACCand BCD »(AxB)CcC(CxD). 

(k) The converse of (j). 

(1) The converse of (j), assuming that A and B are nonempty. 
(m) (A x B)U (C x D) =(AUC) x (BUD). 

(n) (A x B)N (C x D) =(4 NC) x (BND). 

(0) A x (B — C) = (A x B) — (A x C). 

(p) (4- B)x(C-D)=(áxC-BxC)-AxD. 
(q) (A x B) — (C x D) = (A — C) x (B — D). 


3. (a) Write the contrapositive and converse of the following statement: “If x < 0, 
then x? — x > 0,” and determine which (if any) of the three statements are 
true. 

(b) Do the same for the statement “If x > 0, then x2 — x > 0.” 


4. Let A and B be sets of real numbers. Write the negation of each of the following 
statements: 
(a) For every a € A, it is true that a eB. 
(b) For at least one a € A, it is true that a? € B. 
(c) For every a € A, it is true that a? ¢ B. 
(d) For at least one a ¢ A, itis true that a? € B. 


5. Let A be a nonempty collection of sets. Determine the truth of each of the 
following statements and of their converses: 
(a) x € Usca A > x E A for at least one A E€ A. 
(b) x € Usca A => x E A for every AE A. 
(c) x € Maea A => x E A for at least one A E€ A. 
(d) x € faea A = x € A for every AE A. 


6. Write the contrapositive of each of the statements of Exercise 5. 
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7. Given sets A, B, and C, express each of the following sets in terms of A, B, 
and C, using the symbols U, N, and —. 


D={x|x € Aand(x € Borx Ee C)}, 
E={x|(xeAandxe B)orx eC}, 
F={x|xeAandxeBoa>xe€EC)}. 


8. If a set A has two elements, show that P(A) has four elements. How many 
elements does P(A) have if A has one element? Three elements? No elements? 
Why is P(A) called the power set of A? 


9. Formulate and prove DeMorgan’s laws for arbitrary unions and intersections. 


10. Let R denote the set of real numbers. For each of the following subsets of R x R, 
determine whether it is equal to the cartesian product of two subsets of R. 
(a) {(x, y) | x is an integer}. 
(b) {a,y)10<y <I}. 
(c) (x,y) ly >x} 
(d) {(x, y) | x is not an integer and y is an integer}. 
(e) (x,y) |x? +y? < 1}. 


§2 Functions 


The concept of function is one you have seen many times already, so it is hardly nec- 
essary to remind you how central it is to all mathematics. In this section, we give the 
precise mathematical definition, and we explore some of the associated concepts. 

A function is usually thought of as a rule that assigns to each element of a set A, 
an element of a set B. In calculus, a function is often given by a simple formula such 
as f(x) = 3x? + 2 or perhaps by a more complicated formula such as 


oo 
f(x) = Pt. 
k=l 


One often does not even mention the sets A and B explicitly, agreeing to take A to be 
the set of all real numbers for which the rule makes sense and B to be the set of all real 
numbers. 

As one goes further in mathematics, however, one needs to be more precise about 
what a function is. Mathematicians think of functions in the way we just described, 
but the definition they use is more exact. First, we define the following: 


Definition. A rule of assignment is a subset r of the cartesian product C x D of two 
sets, having the property that each element of C appears as the first coordinate of at 
most one ordered pair belonging to r. 
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Thus, a subset r of C x D is a rule of assignment if 
[(c, d) € r and (c, d’) € r] = [d = d']. 


We think of r as a way of assigning, to the element c of C, the element d of D for 
which (c,d) € r. 

Given a rule of assignment r, the domain of r is defined to be the subset of C 
consisting of all first coordinates of elements of r, and the image set of r is defined as 
the subset of D consisting of all second coordinates of elements of r. Formally, 


domain r = {c | there exists d € D such that (c, d) € r}, 
image r = {d | there exists c € C such that (c, d) € r}. 


Note that given a rule of assignment r, its domain and image are entirely determined. 
Now we can say what a function is. 


Definition. A function f is a rule of assignment, together with a set B that contains 
the image set of r. The domain A of the rule r is also called the domain of the 
function f; the image set of r is also called the image set of f; and the set B is called 
the range of f.t 


If f is a function having domain A and range B, we express this fact by writing 
f:A— B, 


which is read “f is a function from A to B,” or “f is a mapping from A into B,” or 
simply “f maps A into B.” One sometimes visualizes f as a geometric transformation 
physically carrying the points of A to points of B. 

If f : A > B and if a is an element of A, we denote by f(a) the unique element 
of B that the rule determining f assigns to a; it is called the value of f at a, or 
sometimes the image of a under f. Formally, if r is the rule of the function f, then 
f(a) denotes the unique element of B such that (a, f(a)) € r. 

Using this notation, one can go back to defining functions almost as one did before, 
with no lack of rigor. For instance, one can write (letting R denote the real numbers) 


“Let f be the function whose rule is {(x, x3 +1) | x e R} and whose 
range is R,” 


or one can equally well write 
“Let f : R > R be the function such that f(x) = x? + 1.” 


Both sentences specify precisely the same function. But the sentence “Let f be the 
function f(x) = x? + 1” is no longer adequate for specifying a function because it 
specifies neither the domain nor the range of f. 


t Analysts are apt to use the word “range” to denote what we have called the “image set” of f 
They avoid giving the set B a name. 
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Definition. If f: A — B and if Apo is a subset of A, we define the restriction of f 
to Ao to be the function mapping Ao into B whose rule is 


{(a, f(@)) | a € Ao}. 


It is denoted by f{Ao, which is read “f restricted to Ao.” 


EXAMPLE 1. Let R denote the real numbers and let È, denote the nonnegative reals. 
Consider the functions 

f:R—R defined by = f(x) = x’, 

g:R, —R defined by g(x) =x’, 

h. R—>R, defined by h(x) = x?, 

ko Ry — R, defined by k(x) = x?, 
The function g is different from the function f because their rules are different subsets of 
R x R; it is the restriction of f to the set R4. The function A is also different from f, even 


though their rules are the same set, because the range specified for A is different from the 
range specified for f. The function k is different from all of these. These functions are 


pictured in Figure 2.1 
g A k 


L, i 


Figure 2.1 


Restricting the domain of a function and changing its range are two ways of form- 
ing a new function from an old one. Another way is to form the composite of two 
functions. 


Definition. Given functions f : A > B and g : B — C, we define the composite 
go f of f and g as the function g o f : A — C defined by the equation (g o f)(a) = 
a(f(@)). 


Formally, g o f : A — C is the function whose rule is 
{(a, c) | For some b € B, f(a) = band g(b) = c}. 


We often picture the composite g o f as involving a physical movement of the point a 
to the point f(a), and then to the point g( f (a)), as illustrated in Figure 2.2. 
Note that g o f is defined only when the range of f equals the domain of g. 
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IT 
e f(a)=b 
e g(f(a)J=g(b)=c 


c 
8 


Figure 2.2 


EXAMPLE 2. The composite of the function f : R > R given by f(x) = 3x? + 2 and 
the function g : R — R given by g(x) = 5x is the function g o f : R > R given by 


(g o f(x) = g(f (2)) = 8x? + 2) = 58x? + 2). 


The composite f o g can also be formed in this case; it is the quite different function 
fog:R-— R given by 


(f og)(x) = f(g(x)) = f Sx) = 35x)? +2. 


Definition. A function f : A — B is said to be injective (or one-to-one) if for each 
pair of distinct points of A, their images under f are distinct. It is said to be surjective 
(or f is said to map A onto B) if every element of B is the image of some element 
of A under the function f. If f is both injective and surjective, it is said to be bijective 
(or is called a one-to-one correspondence). 


More formally, f is injective if 
(f(a) = f(a) = [a =a’), 
and f is surjective if 
[b € B] => [b = f(a) for at least one a € A]. 


Injectivity of f depends only on the rule of f; surjectivity depends on the range 
of f as well. You can check that the composite of two injective functions is injec- 
tive, and the composite of two surjective functions is surjective; it follows that the 
composite of two bijective functions is bijective. 

If f is bijective, there exists a function from B to A called the inverse of f. It is 
denoted by f~! and is defined by letting f~! (b) be that unique element a of A for 
which f(a) = b. Given b € B, the fact that f is surjective implies that there exists 
such an element a € A; the fact that f is injective implies that there is only one such 
element a. It is easy to see that if f is bijective, fT! is also bijective. 


EXAMPLE 3. Consider again the functions f, g, h, and k of Figure 2.1. The function 
f : R > R given by f(x) = x? is neither injective nor surjective. Its restriction g to the 
nonnegative reals is injective but not surjective. The function : R —> R, obtained from f 
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by changing the range is surjective but not injective. The function k . R, > R, obtained 
from f by restricting the domain and changing the range is both injective and surjective, 
so it has an inverse. Its inverse is, of course, what we usually cali the square-root function. 


A useful criterion for showing that a given function f is bijective is the following, 
whose proof is left to the exercises: 


Lemma 2.1. Let f : A — B. If there are functions g : B > Aandh: B > A 
such that g( f(a)) = a for every a in A and f(h(b)) = b for every b in B, then f is 
bijective and g = h = f—!. 


Definition. Let f : A — B. If Ag is a subset of A, we denote by f (Ao) the set 
of all images of points of Ao under the function f; this set is called the image of Ao 
under f. Formally, 


f (Ao) = {b | b = f(a) for at least one a € Ao}. 


On the other hand, if Bo is a subset of B, we denote by f7 l (Bo) the set of all elements 
of A whose images under f lie in Bo; it is called the preimage of By under f (or the 
“counterimage,” or the “inverse image,” of Bo). Formally, 


f~'(Bo) = {a | f(a) € Bo). 


Of course, there may be no points a of A whose images lie in Bo; in that case, f ~!(Bo) 
is empty. 


Note that if f : A — B is bijective and Bọ C B, we have two meanings for the 
notation f~!(Bp). It can be taken to denote the preimage of Bo under the function f 
or to denote the image of By under the function f~! : B — A. These two meanings 
give precisely the same subset of A, however, so there is, in fact, no ambiguity. 

Some care is needed if one is to use the f and f—! notation correctly. The opera- 
tion f~!, for instance, when applied to subsets of B, behaves very nicely; it preserves 
inclusions, unions, intersections, and differences of sets. We shall use this fact fre- 
quently. But the operation f, when applied to subsets of A, preserves only inclusions 
and unions. See Exercises 2 and 3. 

As another situation where care is needed, we note that it is not in general true that 
f7!(f(Ao)) = Ag and f(F7"(Bo)) = Bo. (See the following example.) The relevant 
rules, which we leave to you to check, are the following: If f : A— B and if Ag C A 
and Bo C B, then 


Ao C f'(f(Ao)) and f(f7'(Bo)) C Bo. 


The first inclusion is an equality if f is injective, and the second inclusion is an equality 
if f is surjective. 
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EXAMPLE 4. Consider the function f : R > R given by f(x) = 3x7 +2 (Figure 2.3). 
Let [a, b] denote the closed interval a < x < b. Then 


fF C0, DY = AT, 5) =[-L 1, and 
fF 0, SD) = fl-1, 1) = 12, 5). 


Figure 2.3 


Exercises 


1. Let f : A > B. Let Ap C A and Bo C B. 
(a) Show that Ag C f~!(f(Ao)) and that equality holds if f is injective. 
(b) Show that f(f~!(Bo)) C Bo and that equality holds if f is surjective. 
2. Let f : A > Band let A; C A and B; C B fori =O andi = 1. Show that f7 
preserves inclusions, unions, intersections, and differences of sets: 
(a) BoC By => f~'(Bo) C f~'(B1). 
(b) f~!(Bo U Bi) = fT! (Bo) U f~!(B)). 
(©) f7'(Bo N Bi) = f7!(Bo) N f7! (Bi). 
(d) f~'(Bo — Bi) = f~'(Bo) — f7"(Bv. 
Show that f preserves inclusions and unions only: 
(e) Ao C Ar > f(Ao) C f(A1)- 
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(fF) f(Ao U A1) = f (Ao) U F(A1). 
(g) f(AoM A1) C f(Ag) N f(A1); show that equality holds if f is injective. 
(h) f(Ao — A1) D f(Ao) — f(A); show that equality holds if f is injective. 

3. Show that (b), (c), (f), and (g) of Exercise 2 hold for arbitrary unions and inter- 
sections. 

4. Let f: A — Bandg: BoC. 

(a) If Co C C, show that (g o f)~!(Co) = f7!(g7'(Co)). 

(b) If f and g are injective, show that g o f is injective. 

(c) If go f is injective, what can you say about injectivity of f and g? 
(d) If f and g are surjective, show that g o f is surjective. 

(e) If go f is surjective, what can you say about surjectivity of f and g? 
(f) Summarize your answers to (b)-(e) in the form of a theorem. 

5. In general, let us denote the identity function for a set C by ic. That is, define 
ic : C — C to be the function given by the rule ic(x) = x for all x € C. 
Given f : A > B, we say that a function g : B — A isa left inverse for f if 
go f =i,; and we say that h : B — A is aright inverse for f if f oh =ig. 
(a) Show that if f has a left inverse, f is injective; and if f has a right inverse, 

f is surjective. 
(b) Give an example of a function that has a left inverse but no right inverse. 
(c) Give an example of a function that has a right inverse but no left inverse. 
(d) Cana function have more than one left inverse? More than one right inverse? 
(e) Show that if f has both a left inverse g and a right inverse h, then f is 
bijective and g = h = f—!. 

6. Let f : R — R be the function f(x) = x? — x. By restricting the domain and 
range of f appropriately, obtain from f a bijective function g. Draw the graphs 
of g and g~!. (There are several possible choices for g.) 


§3 Relations 


A concept that is, in some ways, more general than that of function is the concept of 
a relation. In this section, we define what mathematicians mean by a relation, and 
we consider two types of relations that occur with great frequency in mathematics: 
equivalence relations and order relations. Order relations will be used throughout the 
book; equivalence relations will not be used until §22. 


Definition. A relation on a set A is a subset C of the cartesian product A x A. 


If C is a relation on A, we use the notation xCy to mean the same thing as (x, y) € 
C. We read it “x is in the relation C to y.” 

A tule of assignment r for a function f : A > A is also a subset of A x A. But it 
is a subset of a very special kind: namely, one such that each element of A appears as 
the first coordinate of an element of r exactly once. Any subset of A x A isa relation 
on A. 
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EXAMPLE |. Let P denote the set of all people in the world, and define D C P x P by 
the equation 


D = {(x, y) | x isa descendant of y}. 


Then D is a relation on the set P. The statements “x is in the relation D to y” and “x is 
a descendant of y“ mean precisely the same thing, namely, that (x, y) € D. Two other 
relations on P are the following: 


B = {(x, y) | x has an ancestor who is also an ancestor of y}, 
S = {(x, y) | the parents of x are the parents of y}. 


We can call B the “blood relation” (pun intended), and we can call S the “sibling relation.” 
These three relations have quite different properties. The blood relationship is symmetric, 
for instance (if x is a blood relative of y, then y is a blood relative of x), whereas the 
descendant relation is not. We shall consider these relations again shortly. 


Equivalence Relations and Partitions 


An equivalence relation on a set A is a relation C on A having the following three 
properties: 
(1) (Reflexivity) xCx for every x in A. 
(2) (Symmetry) If xCy, then yCx. 
(3) (Transitivity) If xCy and yCz, then xCz. 
EXAMPLE 2. Among the relations defined in Example 1, the descendant relation D is 
neither reflexive nor symmetric, while the blood relation B is not transitive (I am not a 


blood relation to my wife, although my children are!) The sibling relation S is, however, 
an equivalence relation, as you may check. 


There is no reason one must use a Capital letter—or indeed a letter of any sort— 
to denote a relation, even though it is a set. Another symbol will do just as well. 
One symbol that is frequently used to denote an equivalence relation is the “tilde” 
symbol ~. Stated in this notation, the properties of an equivalence relation become 

(1) x ~ x for every x in A. 

(2) If x ~ y, then y ~ x. 

(3) Ifx ~ y and y ~ z, then x ~ z. 
There are many other symbols that have been devised to stand for particular equiva- 
lence relations; we shall meet some of them in the pages of this book. 

Given an equivalence relation ~ on a set A and an element x of A, we define a 
certain subset E of A, called the equivalence class determined by x, by the equation 


E=(yly~x}. 


Note that the equivalence class E determined by x contains x, since x ~ x. Equiva- 
lence classes have the following property: 
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Lemma 3.1. Two equivalence classes E and E' are either disjoint or equal. 


Proof. Let E be the equivalence class determined by x, and let £’ be the equivalence 
class determined by x’. Suppose that E N E’ is not empty; let y be a point of E N F’. 
See Figure 3.1. We show that E = £F’. 


E E' 


Figure 3.1 


By definition, we have y ~ x and y ~ x’. Symmetry allows us to conclude that 
x ~ yand y ~ x’; from transitivity it follows that x ~ x’. If now w is any point of E, 
we have w ~ x by definition; it follows from another application of transitivity that 
w ~ x’. We conclude that E C £’. 

The symmetry of the situation allows us to conclude that E’ C E as well, $o that 
E=F. a 


Given an equivalence relation on a set A, let us denote by & the collection of all 
the equivalence classes determined by this relation. The preceding lemma shows that 
distinct elements of € are disjoint. Furthermore, the union of the elements of & equals 
all of A because every element of A belongs to an equivalence class. The collection & 
is a particular example of what is called a partition of A: 


Definition. A partition of a set A is a collection of disjoint nonempty subsets of A 
whose union is all of A. 


Studying equivalence relations on a set A and studying partitions of A are really 
the same thing. Given any partition D of A, there is exactly one equivalence relation 
on A from which it is derived. 

The proof is not difficult. To show that the partition D comes from some equiv- 
alence relation, let us define a relation C on A by setting xCy if x and y belong to 
the same element of D. Symmetry of C is obvious; reflexivity follows from the fact 
that the union of the elements of D equals all of A; transitivity follows from the fact 
that distinct elements of D are disjoint. It is simple to check that the collection of 
equivalence classes determined by C is precisely the collection D. 

To show there is only one such equivalence relation, suppose that Cı and C2 are 
two equivalence relations on A that give rise to the same collection of equivalence 
classes D. Given x € A, we show that yC,x if and only if yC2x, from which we 
conclude that Cı = Cz. Let E; be the equivalence class determined by x relative to 
the relation C4; let Ez be the equivalence class determined by x relative to the relation 
C2. Then £ is an element of D, so that it must equal the unique element D of D that 
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contains x. Similarly, Ez must equal D. Now by definition, E consists of all y such 
that yC,x; and E? consists of all y such that yC2x. Since E, = D = E7, our result is 
proved. 


EXAMPLE 3 Define two points in the plane to be equivalent if they lie at the same 
distance from the ongin. Reflexivity, symmetry, and transitivity hold trivially. The collec- 
tion & of equivalence classes consists of all circles centered at the ongin, along with the set 
consisting of the origin alone. 


EXAMPLE 4 Define two points of the plane to be equivalent if they have the same 
y-coordinate. The collection of equivalence classes is the collection of all straight lines in 
the plane parallel to the x-axis. 


EXAMPLE 5. Let £ be the collection of all straight lines in the plane paralle! to the line 
y = —x. Then £ is a partition of the plane, since each point lies on exactly one such line. 
The partition £ comes from the equivalence relation on the plane that declares the points 
(x0. yo) and (x1, y1) to be equivalent if x9 + yọ = x1 + y1- 


EXAMPLE 6. Let L’ be the collection of ali straight lines in the plane. Then £’ is not 
a partition of the plane, for distinct elements of £’ are not necessarily disjoint; two lines 
may intersect without being equal. 


Order Relations 


A relation C on a set A is called an order relation (or a simple order, or a linear order) 
if it has the following properties: 

(1) (Comparability) For every x and y in A for which x Æ y, either xCy or yCx. 

(2) (Nonreffexivity) For no x in A does the relation xC x hold. 

(3) (Transitivity) If xCy and yCz, then xCz. 
Note that property (1) does not by itself exclude the possibility that for some pair of 
elements x and y of A, both the relations xCy and yCx hold (since “or” means “one 
or the other, or both”). But properties (2) and (3) combined do exclude this possibil- 
ity; for if both xCy and yCx held, transitivity would imply that xCx, contradicting 
nonreflexivity. 


EXAMPLE 7. Consider the relation on the real line consisting of all pairs (x, y) of real 
numbers such that x < y. It is an order relation, called the “usual order relation,” on the 
real line. A less familiar order relation on the real line is the following: Define xCy if 
x? < y?, or if x? = y? and x < y. You can check that this is an order relation. 


EXAMPLE 8. Consider again the relationships among people given in Example i. The 
blood relation B satisfies none of the properties of an order relation, and the sibling rela- 
tion S satisfies oniy (3). The descendant relation D does somewhat better, for it satisfies 
both (2) and (3); however, comparability stiil fails. Relations that satisfy (2) and (3) occur 
often enough in mathematics to be given a special name. They are called strict partial 
order relations; we shall consider them later (see §1 1). 
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As the tilde, ~, is the generic symbol for an equivalence relation, the “Less than” 
symbol, <, is commonly used to denote an order relation. Stated in this notation, the 
properties of an order relation become 

(1) Ifx Æ y, then either x < y of y <x. 

(2) If x < y, then x # y. 

(3) Ifx < yand y < z, then x < z. 
We shall use the notation x < y to stand for the statement “either x < y or x = y”; 
and we shall use the notation y > x to stand for the statement “x < y.” We write 
x < y <zto mean “x < yand y < z.” 


Definition. If X is a set and < is an order relation on X, and ifa < b, we use the 
notation (a, b) to denote the set 
{x|a<x <b); 


it is called an open interval in X. If this set is empty, we call a the immediate prede- 
cessor of b, and we call b the immediate successor of a. 


Definition. Suppose that A and B are two sets with order relations <4 and <g 
respectively. We say that A and B have the same order type if there is a bijective 
correspondence between them that preserves order, that is, if there exists a bijective 
function f : A —> B such that 


a <a a2 => f(ai) <g f(a). 


EXAMPLE 9. The interval (—i, 1) of real numbers has the same order type as the set R 
of real numbers itself, for the function f ` (-—1, 1) — R given by 


x 
TaS ee 


is an order-preserving bijective correspondence, as you can check. It is pictured in Fig- 
ure 3.2. 


EXAMPLE 10. The subset A = {0} U (1, 2) of R has the same order type as the subset 
(O0.1)={x|O<x< H} 
of R. The function f . A — [0, 1) defined by 


f) =0, 
fœ)=x-4 foxe(i,2) 
is the required order-preserving correspondence. 


One interesting way of defining an order relation, which will be useful to us later 
in dealing with some examples, is the following: 
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Figure 3.2 


Definition. Suppose that A and B are two sets with order relations <4 and <g 
respectively. Define an order relation < on A x B by defining 


a, x bı < æa x bz 


if a, <4 a2, or if a; = az and bj <g bz. It is called the dictionary order relation on 
AxB. 


Checking that this is an order relation involves looking at several separate cases; 
we leave it to you. 

The reason for the choice of terminology is fairly evident. The rule defining < is 
the same as the rule used to order the words in the dictionary. Given two words, one 
compares their first letters and orders the words according to the order in which their 
first letters appear in the alphabet. If the first letters are the same, one compares their 
second letters and orders accordingly. And so on. 

EXAMPLE 11. Consider the dictionary order on the piane R x R. In this order, the 

point p is less than every point lying above it on the vertical line through p, and p is less 

than every point to the right of this vertical line. 


EXAMPLE 12 Consider the set (0, 1) of real numbers and the set Z, of positive integers, 
both in their usual orders; give Z, x {0, 1) the dictionary order. This set has the same order 
type as the set of nonnegative reals; the function 


finxt=n+t-i 


is the required bijective order-preserving correspondence. On the other hand, the set 
[0, 1) x Z4 in the dictionary order has quite a different order type; for example, every 
element of this ordered set has an immediate successor. These sets are pictured in Fig- 
ure 3.3. 
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Figure 3.3 


One of the properties of the real numbers that you may have seen before is the 
“least upper bound property.” One can define this property for an arbitrary ordered set. 
First, we need some preliminary definitions. 

Suppose that A is a set ordered by the relation <. Let Ao be a subset of A. We 
say that the element b is the largest element of Ao if b € Ao and if x < b for every 
x € Ag. Similarly, we say that a is the smallest element of Ao ifa € Ag and ifa < x 
for every x € Ao. It is easy to see that a set has at most one largest element and at 
most one smallest element. 

We say that the subset Ag of A is bounded above if there is an element b of A such 
that x < b for every x € Ag; the element b is called an upper bound for Ao. If the 
set of all upper bounds for Ag has a smallest element, that element is called the least 
upper bound, or the supremum, of Ag. It is denoted by sup Ag; it may or may not 
belong to Ap. If it does, it is the largest element of Ag. 

Similarly, Ao is bounded below if there is an element a of A such thata < x for 
every x € Ag; the element a is called a lower bound for Ag. If the set of all lower 
bounds for Apo has a largest element, that element is called the greatest lower bound, 
or the infimum, of Ao. It is denoted by inf Ag; it may or may not belong to Ao. If it 
does, it is the smallest element of Ao. 

Now we can define the least upper bound property. 


Definition. An ordered set A is said to have the least upper bound property if every 
nonempty subset Ao of A that is bounded above has a least upper bound. Analogously, 
the set A is said to have the greatest lower bound property if every nonempty subset 
Ao of A that is bounded below has a greatest lower bound. 

We leave it to the exercises to show that A has the least upper bound property if 
and only if it has the greatest lower bound property. 


EXAMPLE 13. Consider the set A = (—1, 1) of real numbers in the usual order. As- 
suming the fact that the real numbers have the least upper bound property, it follows that 
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the set A has the least upper bound property. For, given any subset of A having an upper 
bound in A, it foliows that its least upper bound (in the real numbers) must be in A. For 
example, the subset {—1/2n | n € Z4} of A, though it has no largest element, does have a 
least upper bound in A, the number 0. 

On the other hand, the set B = (—1, 0) U (0. 1) does not have the least upper bound 
property. The subset {—1/2n | n € Z,} of B is bounded above by any element of (0, 1), 
but it has no least upper bound in B. 


Exercises 


Equivalence Relations 


1. 


Define two points (xo, yo) and (x1, y1) of the plane to be equivalent if yo — x = 


y- xt. Check that this is an equivalence relation and describe the equivalence 
classes. 

Let C be a relation ona set A. If Ag C A, define the restriction of C to Ag to be 
the relation C N (Ao x Ag). Show that the restriction of an equivalence relation 
is an equivalence relation. 

Here is a “proof” that every relation C that is both symmetnic and transitive is 
also reflexive: “Since C is symmetnc, aCb implies bCa. Since C is transitive, 
aCb and bCa together imply aCa, as desired.” Find the flaw in this argument. 


4. Let f : A— B bea surjective function. Let us define a relation on A by setting 


ag ~ a) if 
f(a) = f (ay). 


(a) Show that this is an equivalence relation. 
(b) Let A* be the set of equivalence classes. Show there is a bijective correspon- 
dence of A* with B. 


Let S and S’ be the following subsets of the plane: 


b 


S= {(x,y)|y=x+land0 <x <2}, 
S’ = {(x, y) | y — x is an integer}. 


(a) Show that S’ is an equivalence relation on the real line and S’ > S. Describe 
the equivalence classes of S’. 

(b) Show that given any collection of equivalence relations on a set A, their 
intersection is an equivalence relation on A. 

(c) Describe the equivalence relation T on the real line that is the intersection 
of all equivalence relations on the real line that contain S. Describe the 
equivalence classes of T. 
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Order Relations 


6. 


11. 


12. 


13. 


14. 


15. 


Define a relation on the plane by setting 


(xo. yo) < (x1, y1) 
if either yo — x2 < yı — x?, or yo — x2 = yı — x? and xo < xı. Show that this 
is an order relation on the plane, and describe it geometrically. 


. Show that the restriction of an order relation is an order relation. 
. Check that the relation defined in Example 7 is an order relation. 
. Check that the dictionary order is an order relation. 

10. 


(a) Show that the map f : (~I, 1) > R of Example 9 is order preserving. 
(b) Show that the equation g(y) = 2y/[1 + (1 + 4y’)!/?] defines a function 
g : R — (-1, 1) that is both a left and a right inverse for f. 
Show that an element in an ordered set has at most one immediate successor and 
at most one immediate predecessor. Show that a subset of an ordered set has at 
most one smallest element and at most one largest element. 
Let Z+ denote the set of positive integers. Consider the following order relations 
on Z, x Z4: 
(i) The dictionary order. 
(ti) (xo, Yo) < (xı, y1) if either x9 — yo < x1 — yı, OF Xo — Yo = xı — yı and 
yo < yL 
(iii) (xo, Yo) < (x1, y1) if either xo + yo < x1 + yt, OF xo + yo = xı + yı and 
yo < yı. 
In these order relations, which elements have immediate predecessors? Does the 
set have a smallest element? Show that ali three order types are different. 


Prove the following: 

Theorem. If an ordered set A has the least upper bound property, then it has the 
greatest lower bound property. 

If C is a relation on a set A, define a new relation D on A by letting (b, a) € D 
if (a,b) EC. 

(a) Show that C is symmetric if and only if C = D. 

(b) Show that if C is an order relation, D is also an order relation. 

(c) Prove the converse of the theorem in Exercise 13. 

Assume that the real line has the least upper bound property. 

(a) Show that the sets 


(0, 1J={x|O<x<1}, 
0,1) ={x]0<x <1} 
have the least upper bound property. 


(b) Does [0, 1] x [0, 1] in the dictionary order have the least upper bound prop- 
erty? What about [0, 1] x [0, 1)? What about [0, 1) x [0, 1]? 
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§4 The Integers and the Real Numbers 


Up to now we have been discussing what might be called the logical foundations for 
our study of topology—the elementary concepts of set theory. Now we tum to what 
we might call the mathematical foundations for our study—the integers and the real 
number system. We have already used them in an informal way in the examples and 
exercises of the preceding sections. Now we wish to deal with them more formally. 

One way of establishing these foundations is to construct the real number system, 
using only the axioms of set theory—to build them with one’s bare hands, so to speak. 
This way of approaching the subject takes a good deal of time and effort and is of 
greater logical than mathematical interest. 

A second way is simply to assume a set of axioms for the real numbers and work 
from these axioms. In the present section, we shall sketch this approach to the real 
numbers. Specifically, we shall give a set of axioms for the real numbers and shall 
indicate how the familiar properties of real numbers and the integers are derived from 
them. But we shall leave most of the proofs to the exercises. If you have seen all 
this before, our description should refresh your memory. If not, you may want to 
work through the exercises in detail in order to make sure of your knowledge of the 
mathematical foundations. 

First we need a definition from set theory. 


Definition. A binary operation on a Set A is a function f mapping A x A into A. 


When dealing with a binary operation f on a set A, we usually use a notation 
different from the standard functional notation introduced in §2. Instead of denoting 
the value of the function f at the point (a, a’) by f(a, a’), we usually write the symbol 
for the function between the two coordinates of the point in question, writing the value 
of the function at (a, a’) as afa’. Furthermore (just as was the case with relations), 
it is more common to use some symbol other than a letter to denote an operation. 
Symbols often used are the plus symbo! +, the multiplication symbols - and o, and the 
asterisk *; however, there are many others. 


Assumption 


We assume there exists a set R, called the set of real numbers, two binary operations + 
and - on R, called the addition and multiplication operations, respectively, and an order 
relation < on R, such that the following properties hold: 


Algebraic Properties 
(I) @+y)+z2=x4+(y¥ +2), 
(x-y)-2=x-(y-2z) forallx, y, zin R. 
2) x+y=y+x, 
x-y = y- x forall x, y in R. 


§4 The Integers and the Real Numbers 31 


(3) There exists a unique element of R called zero, denoted by 0, such that x +0 = x 
for all x € R. 
There exists a unique element of R called one, different from 0 and denoted by 1, 
such that x - 1 = x for all x € R. 


(4) For each x in R, there exists a unique y in R such that x + y = 0. 
For each x in R different from 0, there exists a unique y in R such that x - y = 1. 


(5) x-(y +z) = (x+y) + (x-z) forall x,y,z € R. 


A Mixed Algebraic and Order Property 
(6) Ifx > y, thenx+z> y+z. 
Ifx > yandz > 0,thenx-z> y-z. 


Order Properties 
(7) The order relation < has the least upper bound property. 
(8) Ifx < y, there exists an element z such that x < z and z < y. 

From properties (1)—(5) follow the familiar “laws of algebra.” Given x, one de- 
notes by —x that number y such that x + y = 0; it is called the negative of x. One 
defines the subtraction operation by the formula z — x = z + (—x). Similarly, given 
x #0, one denotes by 1/x that number y such that x - y = 1; it is called the reciprocal 
of x. One defines the quotient z/x by the formula z/x = z - (1/x). The usual laws of 
signs, and the rules for adding and multiplying fractions, follow as theorems. These 
laws of algebra are listed in Exercise | at the end of the section. We often denote x - y 
simply by xy. 

When one adjoins property (6) to properties (1)-(5), one can prove the usual “laws 
of inequalities,” such as the following: 


If x > y and z < 0, then x-z < y-z. 
-l <0 and 0<l. 


The laws of inequalities are listed in Exercise 2. 

We define a number x to be positive if x > 0, and to be negative if x < 0. We 
denote the positive reals by R, and the nonnegative reals (for reasons to be explained 
later) by R}. Properties (1)-(6) are familiar properties in modem algebra. Any set 
with two binary operations satisfying (1)+(5) is called by algebraists a field; if the field 
has an order relation satisfying (6), it is called an ordered field. 

Properties (7) and (8), on the other hand, are familiar properties in topology. They 
involve only the order relation; any set with an order relation satisfying (7) and (8) is 
called by topologists a linear continuum. 

Now it happens that when one adjoins to the axioms for an ordered field [proper- 
ties (1)-(6)] the axioms for a linear continuum [properties (7) and (8)], the resulting 
list contains some redundancies. Property (8), in particular, can be proved as a conse- 
quence of the others; given x < y one can show that z = (x + y)/(1 + 1) satisfies 
the requirements of (8). Therefore, in the standard treatment of the real numbers, 
properties (1 )-(7) are taken as axioms, and property (8) becomes a theorem. We have 
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included (8) in our list merely to emphasize the fact that it and the least upper bound 
property are the two crucial properties of the order relation for R. From these two 
properties many of the topological properties of R may be derived, as we shall see in 
Chapter 3. 

Now there is nothing in this list as it stands to tell us what an integer is. We now 
define the integers, using only properties (1)—(6). 


Definition. A subset A of the real numbers is said to be inductive if it contains the 
number 1, and if for every x in A, the number x +! is also in A. Let A be the collection 
of all inductive subsets of R. Then the set Z}, of positive integers is defined by the 


equation 
Z+ = Q A. 
AGA 


Note that the set R} of positive real numbers is inductive, for it contains 1 and 
the statement x > 0 implies the statement x + 1 > 0. Therefore, Z} C Ry, so the 
elements of Z} are indeed positive, as the choice of terminology suggests. Indeed, one 
sees readily that | is the smallest element of Z4, because the set of all real numbers x 
for which x > 1 is inductive. 

The basic properties of Z,, which follow readily from the definition, are the fol- 
lowing: 

(1) Z4 is inductive. 
(2) (Pnnciple of induction). If A is an inductive set of positive integers, then A = 
Z+. 
We define the set Z of integers to be the set consisting of the positive integers Z4, 
the number 0, and the negatives of the elements of Z}. One proves that the sum, 
difference, and product of two integers are integers, but the quotient is not necessarily 
an integer. The set Q of quotients of integers is called the set of rational numbers. 

One proves also that, given the integer n, there is no integer a such that n < a < 
n+l. 

If n is a positive integer, we use the symbol S, to denote the set of all positive 
integers less than n; we call it a section of the positive integers. The set Sı is empty, 
and S,41 denotes the set of positive integers between | and n, inclusive. We also use 
the notation 


(1,....a} = Sayi 


for the latter set. 

Now we prove two properties of the positive integers that may not be quite so 
familiar, but are quite useful. They may be thought of as alternative versions of the 
induction principle. 


Theorem 4.1 (Well-ordering property). Every nonempty subset of Z, has a small- 
est element. 
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Proof. We first prove that, for each n € Z}, the following statement holds: Every 
nonempty subset of {1,.... n} has a smallest element. 

Let A be the set of all positive integers n for which this statement holds. Then A 
contains 1, since if n = 1, the only nonempty subset of {1, ..., n} is the set {1} itself. 
Then, supposing A contains n, we show that it contains n + 1. So let C be anonempty 
subset of the set {1,...,” + 1}. If C consists of the single element n + 1, then that 
element is the smallest element of C. Otherwise, consider the set CN{1, ..., n}, which 
is nonempty. Because n € A, this set has a smallest element, which will automatically 
be the smallest element of C also. Thus A is inductive, so we conclude that A = Z,; 
hence the statement is true for all n € Z,. 

Now we prove the theorem. Suppose that D is a nonempty subset of Z}. Choose 


an element n of D. Then the set A = DN (1,..., n} is nonempty, so that A has a 
smallest element k. The element k is automatically the smallest element of D as well. 
z 


Theorem 4.2 (Strong induction principle). Let A be a set of positive integers. 
Suppose that for each positive integer n, the statement S, C A implies the statement 
n € A. Then A = Z}. 


Proof. If A does not equal all of Z}, let n be the smallest positive integer that is not 
in A. Then every positive integer less than n is in A, so that S, C A. Our hypothesis 
implies that n € A, contrary to assumption. a 


Everything we have done up to now has used only the axioms for an ordered field, 
properties (1)-(6) of the real numbers. At what point do you need (7), the least upper 
bound axiom? 

For one thing, you need the least upper bound axiom to prove that the set Z, of 
positive integers has no upper bound in R. This is the Archimedean ordering property 
of the real line. To prove it, we assume that Z} has an upper bound and derive a 
contradiction. If Z} has an upper bound, it has a least upper bound b. There exists 
n € Z} such that n > b — 1; for otherwise, b — 1 would be an upper bound for Z, 
smaller than b. Then n + | > b, contrary to the fact that b is an upper bound for Z4. 

The least upper bound axiom is also used to prove a number of other things 
about R. It is used for instance to show that R has the greatest lower bound prop- 
erty. It is also used to prove the existence of a unique positive square root ./x for 
every positive real number. This fact, in turn, can be used to demonstrate the existence 
of real numbers that are not rational numbers; the number /2 is an easy example. 

We use the symbol 2 to denote | + 1, the symbol 3 to denote 2 + 1, and so on 
through the standard symbols for the positive integers. It is a fact that this procedure 
assigns to each positive integer a unique symbol, but we never need this fact and shall 
not prove it. 

Proofs of these properties of the integers and real numbers, along with a few other 
properties we shall need later, are outlined in the exercises that follow. 
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Exercises 


1. Prove the following “laws of algebra” for R, using only axioms (1)-(5): 
(a) Ifx+y =x, then y = 0. 
(b) 0-x = 0. (Hint: Compute (x + 0) - x.] 


(c) -0 =0. 

(d) -(-x) =x. 

(e) x(-y) = (xy) = (-x)y. 
(f) (-l)x = -x. 


(8) x(y — 2) = xy — xz. 
(h) =(x + y) = =x — y; -Q - y) = -x + y. 
(i) Ifx #0andx -y = x, then y = l. 
(j) x/x=lifx £0. 
(k) x/l =x. 
(Q) x #0and y #0 = xy £0. 
(m) (1/y)(1/2) = 1/(y2) if y, z #0. 
(n) (x/y)(w/z) = (xw)/(y2) if y, z #0. 
(0) (x/y) + (w/z) = (xz + wy)/(y2) if y, z £0. 
(P) x #0 I/x #0. 
(q) 1/(w/z) = z/w if w, z £0. 
(t) (x/y)/(w/z) = (xz)/(yw) if y, w, z £0. 
(s) (ax)/y =a(x/y)if y #0. 
®© (-x)/y =x/(-y) = —(x/y) if y # 0. 
2. Prove the following “laws of inequalities” for R, using axioms (1)-(6) along with 
the results of Exercise 1: 
(a) x >yandw>z>x+w>ytz. 
(b) x >Oandy >O>x+y>Oandx-y>0. 
(c) x >0 & —x <0. 
(d) x > y & —x < —y. 
(e) x > yandz < 0 > xz < yz. 
(f) x £053 x7 > 0, where x? = x - x. 
(g) -1<0<1 
(h) xy > 0 & x and y are both positive or both negative. 
(i) x >O=> I/x > 0. 
G) x > y >0 s 1/x < I/y. 
(k) x < y >x < (x +y)/2 <y. 
3. (a) Show that if A is a collection of inductive sets, then the intersection of the 
elements of .A is an inductive set. 
(b) Prove the basic properties (1) and (2) of Z4. 
4. (a) Prove by induction that given n € Z4, every nonempty subset of {1,..., n} 
has a largest element. 
(b) Explain why you cannot conclude from (a) that every nonempty subset of Z4 
has a largest element. 
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Prove the following properties of Z and Z4: 

(a) a,b € Z4 = a +b € Z}. [Hint: Show that givena € Z+, the set 
X = {x |x e Randa +x € Z,} is inductive.] 

(b) a,b € Z4 >a -b € Z4. 

(c) Show thata € Z4 > a — l € Z, U {0}. (Hint: Let X = {x | x €e R and 
x — 1 € Z4 U {0}; show that X is inductive.] 

(d) c,d € Z= c +d € Zande —d e Z. (Hint. Prove it first for d = 1.] 

(e) cdeZ>c- dez. 


. Leta € R. Define inductively 


forn € Z4. (See §7 for a discussion of the process of inductive definition.) 
Show that for n,m € Z4 and a,b € R, 


a”a™ =- antm 
(a")™ = ar: 
a™b™ = (ab)". 


These are called the laws of exponents. (Hint: For fixed n, prove the formulas 
by induction on m.] 


. Leta € Randa #0. Define a? = 1, and for n € Z4, a7" = 1/a". Show that 


the laws of exponents hold fora, b # 0 and n, m E€ Z. 


. (a) Show that R has the greatest lower bound property. 


(b) Show that inf({1/n |n € Z4} =0. 
(c) Show that given a with O < a < 1, inffa" | n € Z4} = 0. (Aint: Let 
h = (1 — a)/a, and show that (1 +h)" > 1 +nh.] 


. (a) Show that every nonempty subset of Z that is bounded above has a largest 


element. 
(b) If x ¢ Z, show there is exactly one n € Z such thatn <x <n+l. 
(c) If x — y > 1, show there is at least one n € Z such that y < n < x. 
(d) If y < x, show there is a rational number z such that y < z < x. 


Show that every positive number a has exactly one positive square root, as fol- 
lows: 
(a) Show that if x > 0 and0 < h < 1, then 

(x thy? <x? +h(2x +1), 

(x — h? > x? — h(2x). 


(b) Let x > 0. Show that if x? < a, then (x + h)? < a for some h > 0; and if 
x? > a, then (x — h)? > a for some h > 0. 
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(c) Given a > 0, let B be the set of all real numbers x such that x? < a. 
Show that B is bounded above and contains at least one positive number. 
Let b = sup B; show that b? = a. 

(d) Show that if b and c are positive and b? = c?, then b = c. 

11. Given m € Z, we say that m is even if m/2 € Z, and m is edd otherwise. 

(a) Show that if m is odd, m = 2n + 1 for some n e Z. (Hint: Choose n so that 
n<m/2<n+1] 

(b) Show that if p and q are odd, so are p -q and p", for any n € Z4. 

(c) Show that if a > 0 is rational, then a = m/n for some m,n € Z4 where 
not both m and n are even. [Hint: Let n be the smallest element of the set 
{x |x € Z4 and x -a € Z4}.] 

(d) Theorem. J2 is irrational. 


§5 Cartesian Products 


We have already defined what we mean by the cartesian product A x B of two sets. 
Now we introduce more general cartesian products. 


Definition. Let A be a nonempty collection of sets. An indexing function for A is 
a surjective function f from some set J, called the index set, to A. The collection A, 
together with the indexing function f, is called an indexed family of sets. Given 
a € J, we shall denote the set f(a) by the symbol Ag. And we shall denote the 
indexed family itself by the symbol 


{Aalaes , 


which is read “the family of all Ag, as œ ranges over J.” Sometimes we write merely 
{Aq}, if it is clear what the index set is. 


Note that although an indexing function is required to be surjective, it is not re- 
quired to be injective. It is entirely possible for Ag and Ag to be the same set of A, 
even though a # £. 

One way in which indexing functions are used is to give a new notation for arbi- 
trary unions and intersections of sets. Suppose that f : J — Ais an indexing function 
for A; let A, denote f(a). Then we define 


U Áa = {x | for at least one a € J, x € Ag}, 
acl 


and 


N Aq = {x | for every a € J, x € Ag}. 
ael 
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These are simply new notations for previously defined concepts; one sees at once 
(using the surjectivity of the index function) that the first equals the union of all the 
elements of A and the second equals the intersection of all the elements of A. 

Two especially useful index sets are the set {1,..., n} of positive integers from 1 
to n, and the set Z4 of all positive integers. For these index sets, we introduce some 
special notation. If a collection of sets is indexed by the set {1,..., n}, we denote the 
indexed family by the symbol {A1, ..., An}, and we denote the union and intersection, 
respectively, of the members of this family by the symbols 


A, U---UA, and AN-N An. 


In the case where the index set is the set Z4, we denote the indexed family by the 
symbol {A,, A2, ...}, and the union and intersection by the respective symbols 


A, UA2U:-. and A,;NA2N---. 
Definition. Let m be a positive integer. Given a set X, we define an me-tuple of 
elements of X to be a function 
x:{l,...,m}— X. 


If x is an m-tuple, we often denote the value of x at i by the symbol x; rather than x(/) 
and call it the ith coordinate of x. And we often denote the function x itself by the 
symbol 


(41, .--. Xm). 
Now let {A,,..., Am} be a family of sets indexed with the set {1,...,m}. Let X = 
A1 U---U Am. We define the cartesian product of this indexed family, denoted by 


m 
[]4i o Arx- x Am, 


to be the set of all m-tuples (x), ..., Xm) of elements of X such that x; € A; for each i. 


EXAMPLE i. We now have two definitions for the symbol A x B. One definition is, 
of course, the one given earlier, under which A x B denotes the set of ali ordered pairs 
(a, b) such thata € A and b € B. The second definition, just given, defines A x B as 
the set of all functions x : {1,2} — A U B such that x(1} € A and x(2) € B. There 
is an obvious bijective correspondence between these two sets, under which the ordered 
pair (a,b) corresponds to the function x defined by x(1) = a and x(2) = b. Since we 
commonly denote this function x in “tuple notation” by the symbol (a, b), the notation 
itself suggests the correspondence. Thus for the cartesian product of two sets, the general 
definition of cartesian product reduces essentially to the earlier one. 


EXAMPLE 2. How does the cartesian product A x B x C differ from the cartesian products 
A x (B x C) and (A x B) x C? Very little. There are obvious bijective correspondences 
between these sets, indicated as follows 


(a, b,c) <— (a, (b, c)) <> ((a, b), c). 
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Definition. Given a set X, we define an w-tuple of elements of X to be a function 
xX: Z4 — X; 
we also call such a function a sequence, or an infinite sequence, of elements of X. If 


x is an w-tuple, we often denote the value of x ati by x; rather than x(i), and call it the 
ith coordinate of x. We denote x itself by the symbol 


(41, %2,---) or (Xn)neZ,- 


Now let {A1, A2, ...} be a family of sets, indexed with the positive integers; let X be 
the union of the sets in this family. The cartesian product of this indexed family of 
sets, denoted by 


[I 4i or A, x Az xX=---, 

ieZy 
is defined to be the set of all w-tuples (x), x2, ...) of elements of X such that x, € A; 
for each i. 


Nothing in these definitions requires the sets A, to be different from one another. 
Indeed, they may all equal the same set X. In that case, the cartesian product A, x 
- X Am is just the set of all m-tuples of elements of X, which we denote by X”. 
Similarly, the product A; x Az x --- is just the set of all w-tuples of elements of X, 
which we denote by X®. 
Later we will define the cartesian product of an arbitrary indexed family of sets. 
EXAMPLE 3. TfR is the set of real numbers, then R” denotes the set of all m-tuples of 
real numbers; it is often called euclidean m-space (although Euclid would never recognize 
it). Analogously, R” is sometimes called “infinite-dimensional euclidean space”; it is the 
set of all w-tuples (x1, x2,. .) of real numbers, that is, the set of all functions x : Z, — R. 


Exercises 


1. Show there is a bijective correspondence of A x B with B x A. 
2. (a) Show that if n > 1 there is bijective correspondence of 


AX- X An with (A, x --- X Án-1)X Án. 


(b) Given the indexed family {A), Az,...}, let B, = Azj-1 x Áz; for each 
positive integer i. Show there is bijective correspondence of A, x A2 X --- 
with By x By x---. 

3. Let A = Aj x A2 x--- andB=B, x Bx. 

(a) Show that if B, C A, for alli, then B C A. (Strictly speaking, if we are 
given a function mapping the index set Z, into the union of the sets B;, we 
must change its range before it can be considered as a function mapping Z4 
into the union of the sets A;. We shall ignore this technicality when dealing 
with cartesian products). 
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(b) Show the converse of (a) holds if B is nonempty. 

(c) Show that if A is nonempty, each A, is nonempty. Does the converse hold? 
(We will return to this question in the exercises of §19.) 

(d) What is the relation between the set A U B and the cartesian product of the 
sets A; U B,? What is the relation between the set A N B and the cartesian 
product of the sets A; N B,? 


4. Letm,né€ Z4. Let X £ Ø. 

(a) Ifm <n, find an injective map f : X” —> X”. 

(b) Find a bijective map g : X” x X” > X"+, 

(c) Find an injective map h : X" > X”. 

(d) Find a bijective map k : X” x X” — X”. 

(e) Find a bijective map! : X® x X” > X”. 

(f) IfA C B, find an injective map m : (A”)" > B®. 
5. Which of the following subsets of R” can be expressed as the cartesian product 

of subsets of R? 

(a) {x | x; is an integer for all i}. 

(b) {x | x; > i for all i}. 

(c) {x | x; is an integer for all i > 100}. 

(d) {x | x2 = x3}. 
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Finite sets and infinite sets, countable sets and uncountable sets, these are types of sets 
that you may have encountered before. Nevertheless, we shall discuss them in this 
section and the next, not only to make sure you understand them thoroughly, but also 
to elucidate some particular points of logic that will arise later on. First we consider 
finite sets. 

Recall that if n is a positive integer, we use S, to denote the set of positive integers 
less than n; it is called a section of the positive integers. The sets S, are the prototypes 
for what we call the finite sets. 


Definition. A set is said to be finite if there is a bijective correspondence of A with 
some section of the positive integers. That is, A is finite if it is empty or if there is a 
bijection g 


f:A— {l,...,a} 


for some positive integer n. In the former case, we say that A has cardinality 0; in the 
latter case, we say that A has cardinality n. 


For instance, the set {1,..., n} itself has cardinality n, for it is in bijective corre- 
spondence with itself under the identity function. 
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Now note carefully: We have not yet shown that the cardinality of a finite set is 
uniquely determined by the set. It is of course clear that the empty set must have 
cardinality zero. But as far as we know, there might exist bijective correspondences 
of a given nonempty set A with two different sets {1,..., n} and {1,..., mJ]. The 
possibility may seem ridiculous, for it is like saying that it is possible for two people 
to count the marbles in a box and come out with two different answers, both correct. 
Our expenence with counting in everyday life suggests that such is impossible, and in 
fact this is easy to prove when n is a small number such as 1, 2, or 3. But a direct proof 
when n is 5 million would be impossibly demanding. 

Even empirical demonstration would be difficult for such a large value of n. One 
might, for instance, construct an experiment by taking a freight car full of marbles and 
hiring 10 different people to count them independently. If one thinks of the physical 
problems involved, it seems likely that the counters would not all arrive at the same 
answer. Of course, the conclusion one could draw is that at least one person made a 
mistake. But that would mean assuming the correctness of the result one was trying 
to demonstrate empirically An alternative explanation could be that there do exist 
bijective correspondences between the given set of marbles and two different sections 
of the positive integers. 

In real life, we accept the first explanation. We simply take it on faith that our 
expenence in counting comparatively small sets of objects demonstrates a truth that 
holds for arbitrarily large sets as well. 

However, in mathematics (as opposed to real life), one does not have to take this 
statement on faith. If it is formulated in terms of the existence of bijective correspon- 
dences rather than in terms of the physical act of counting, it is capable of mathemat- 
ical proof. We shall prove shortly that if n 4 m, there do not exist bijective functions 
mapping a given set A onto both the sets {}, ..., n} and {l,..., m}. 

There are a number of other “intuitively obvious” facts about finite sets that are 
capable of mathematical proof; we shall prove some of them in this section and leave 
the rest to the exercises. Here is an easy fact to start with: 


Lemma 6.1. Letn be a positive integer. Let A be a set; let ag be an element of A. 
Then there exists a bijective correspondence f of the set A with the set {1,...,n +1} 
if and only if there exists a bijective correspondence g of the set A — {ag} with the set 
{1,... a}. 


Proof. There are two implications to be proved. Let us first assume that there is a 
bijective correspondence 


g: A-— {ao} — {1,..., n}. 
We then define a function f : A —> {1,..., + 1} by setting 


f(x)=a(x)  forx € A— {ao}, 
filao) =n+). 


One checks at once that f is bijective. 
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To prove the converse, assume there is a bijective correspondence 
f:A— {l,...,24+ 1}. 


If f maps ag to the number n + 1, things are especially easy; in that case, the restric- 
tion f|A — {ao} is the desired bijective correspondence of A — {ao} with {1,..., n}. 
Otherwise, let f (ag) = m, and let a, be the point of A such that f (a1) = n + 1. Then 
a, # ao. Define a new function 


h:A— {l,...,2+1} 
by setting 


hlao) =2 +1, 
h(a) =m, 
A(x) = f(x) forx € A— {ag} — {a1}. 


See Figure 6.1. It is easy to check that A is a bijection. 
Now we are back in the easy case; the restriction h| A — {ao} is the desired bijection 
of A — {ag} with {1,..., n}. B 


ED 


Figure 6.1 


From this lemma a number of useful consequences follow: 


Theorem 6.2. Let A be a set; suppose that there exists a bijection f : A > {1,...,n} 
for some n € Z,. Let B be a proper subset of A. Then there exists no bijection 
g : B — {l,...,n}; but (provided B # Ø) there does exist a bijection h : B —> 
{1,...,m} for somem < n. 


Proof. The case in which B = Ø is trivial, for there cannot exist a bijection of the 
empty set B with the nonempty set {1,..., n}. 

We prove the theorem “by induction.” Let C be the subset of Z, consisting of 
those integers n for which the theorem holds. We shall show that C is inductive. From 
this we conclude that C = Z,, so the theorem is true for all positive integers n. 

First we show the theorem is true for n = 1. In this case A consists of a single 
element {a}, and its only proper subset B is the empty set. 
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Now assume that the theorem is true for n; we prove it true for n + 1. Suppose 
that f : A > {l,..., n + 1} is a bijection, and B is a nonempty proper subset of A. 
Choose an element ao of B and an element a; of A — B. We apply the preceding 
lemma to conclude there is a bijection 


g:A—{ag} — {l,..., n}. 


Now B — {ao} is a proper subset of A — {ao}, for a, belongs to A — {ag} and not to 
B —{ao}. Because the theorem has been assumed to hold for the integer n, we conclude 
the following: 

(1) There exists no bijection h : B — {ag} > {1,...,n}. 

(2) Either B — {ag} = Ø, or there exists a bijection 


k: B — {ao} — {1,..., p} forsome p <n. 


The preceding lemma, combined with (1), implies that there is no bijection of B with 
Hri n + l}. This is the first half of what we wanted to proved. To prove the second 
half, note that if B — {ao} = Ø, there is a bijection of B with the set {1}; while if 
B — {ao} # Ø, we can apply the preceding lemma, along with (2), to conclude that 
there is a bijection of B with {1,..., p + 1}. In either case, there is a bijection of B 
with {1,..., m} for some m < n + 1, as desired. The induction principle now shows 
that the theorem is true for all n € Z4. a 


Corollary 6.3. If A is finite, there is no bijection of A with a proper subset of itself. 
Proof. Assume that B is a proper subset of A and that f : A —> B isa bijection. By 
assumption, there is a bijection g : A — {1,...,n} for some n. The composite go f~! 
is then a bijection of B with {1,...,}. This contradicts the preceding theorem. W 
Corollary 6.4. Z, is not finite. 
Proof. The function f : Z} —> Z+ — {1} defined by f(n) = n + 1 is a bijection 
of Z, with a proper subset of itself. E 
Corollary 6.5. The cardinality of a finite set A is uniquely determined by A. 
Proof. Letm < n. Suppose there are bijections 

f:A—(l,...,a} 

g:A—>{l,...,m)}. 
Then the composite 

gof !:{l,...,m} — {l,...,m} 


is a bijection of the finite set {1, ..., n} with a proper subset of itself, contradicting the 
corollary just proved. a 
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Corollary 6.6. If B is a subset of the finite set A, then B is finite. If B is a proper 
subset of A, then the cardinality of B is less than the cardinality of A. 


Corollary 6.7. Let B be a nonempty set. Then the following are equivalent: 
(1) B is finite. 
(2) There is a surjective function from a section of the positive integers onto B. 
(3) There is an injective function from B into a section of the positive integers. 


Proof. (1) => (2). Since B is nonempty, there is, for some n, a bijective function 
f:{1,...,n} > B. 

(2) => (3). If f : {1,...,n} —> B is surjective, define g : B — {1,. .., n} by 
the equation 


g(b) = smallest element of f—'({b}). 


Because f is surjective, the set f -li(b)} is nonempty; then the well-ordering property 
of Z, tells us that g(b) is uniquely defined. The map g is injective, for if b Æ b’, 
then the sets f~!({b}) and f ~!((b'}) are disjoint, so their smallest elements must be 
different. 

(3) => (1). Ifg: B > {1,..., 2} is injective, then changing the range of g gives 
a bijection of B with a subset of {1,..., n}. It follows from the preceding corollary 
that B is finite. B 


Corollary 6.8. Finite unions and finite cartesian products of finite sets are finite. 


Proof. We frst show that if A and B are finite, so is A U B. The result is trivial 
if A or B is empty. Otherwise, there are bijections f : {1,...,m} —> A andg : 
{1,..., n} — B for some choice of m and n. Define a function h : {1,...,m + 
n} —> AUB by setting h(i) = f(i) fori = 1,2, ...,m and h(i) = g(i — m) for 
i=m+l,...,m +n. Itis easy to check that h is surjective, from which it follows 
that A U B is finite. 

Now we show by induction that finiteness of the sets A), ..., An implies finiteness 
of their union. This result is trivial form = l. Assuming it true for n — 1, we note that 
A, U---U Aj, is the union of the two finite sets A; U---U A,_, and A,n, so the result 
of the preceding paragraph applies. 

Now we show that the cartesian product of two finite sets A and B is finite. Given 
a € A, the set {a} x B is finite, being in bijective correspondence with B. The set 
A x B is the union of these sets; since there are only finitely many of them, A x B is 
a finite union of finite sets and thus finite. 

To prove that the product A; x --- x A, is finite if each A; is finite, one proceeds 
by induction. a 
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Exercises 


1. (a) Make a list of all the injective maps 


§7 


Just 


f : {1,2,3} — {1, 2,3, 4}. 


Show that none is bijective. (This constitutes a direct proof that a set A of 
cardinality three does not have cardinality four.) 
(b) How many injective maps 


f:{1,...,8} — {1,..., 10} 


are there? (You can see why one would not wish to try to prove directly that 
there is no bijective correspondence between these sets.) 


. Show that if B is not finite and B C A, then A is not finite. 
. Let X be the two-element set {0, 1}. Find a bijective correspondence between 


X® and a proper subset of itself. 


. Let A be a nonempty finite simply ordered set. 


(a) Show that A has a largest element. [Hint: Proceed by induction on the 
cardinality of A.] 
(b) Show that A has the order type of a section of the positive integers. 


. If A x B is finite, does it follow that A and B are finite? 
. (a) Let A = {1,..., n}. Show there is a bijection of P(A) with the cartesian 


product X”, where X is the two-element set X = {0, 1}. 
(b) Show that if A is finite, then P(A) is finite. 


. If A and B are finite, show that the set of all functions f : A — B is finite. 


Countable and Uncountable Sets 


as sections of the positive integers are the prototypes for the finite sets, the set of 


all the positive integers is the prototype for what we call the countably infinite sets. In 


this 


section, we shall study such sets; we shall also construct some sets that are neither 


finite nor countably infinite. This study will lead us into a discussion of what we mean 
by the process of “inductive definition.” 


Definition. A set A is said to be infinite if it is not finite. It is said to be countably 
infinite if there is a bijective correspondence 


f:A— Z}. 


EXAMPLE |. The set Z of all integers is countably infinite. One checks easily that the 
function f : Z > Z, defined by 


2n ifn > 0, 
—2n+1 ifn <0 


jl 


is a bijection. 
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EXAMPLE 2. The product Z, x Z+ is countably infinite. If we represent the elements of 
the product Z, x Z4 by the integer points in the first quadrant, then the left-hand portion 
of Figure 7.1 suggests how to “count” the points, that is, how to put them in bijective 
correspondence with the positive integers. A picture is not a proof, of course, but this 
picture suggests a proof. First, we define a bijection f : Z4 x Z} — A, where A is the 
subset of Z, x Z, consisting of pairs (x, y) for which y < x, by the equation 


fey =a +y-l, y). 


Then we construct a bijection of A with the positive integers, defining g : A — Z4 by the 
formula 


1 
gx, y) = z7“ -ix +y. 


We leave it to you to show that f and g are bijections. 
Another proof that Z} x Z+ is countably infinite will be given later. 


Figure 7.1 


Definition. A set is said to be countable if it is either finite or countably infinite. A 
set that is not countable is said to be uncountable. 


There is a very useful critenon for showing that a set is countable. It is the follow- 
ing: 


Theorem 7.1. Let B be a nonempty set. Then the following are equivalent: 
(1) B is countable. 
(2) There is a surjective function f : Z4 —> B. 
(3) There is an injective function g : B > Z4. 


Proof. (1) => (2). Suppose that B is countable. If B is countably infinite, there is 
a bijection f : Z4 — B by definition, and we are through. If B is finite, there is a 


46 Set Theory and Logic Ch. I 


bijection h : {1,..., n} —> B for somen > 1. (Recall that B # Ø.) We can extend h 
to a surjection f : Z} —> B by defining 


G) = h(i) forl <i <a, 
ee h(l) fori >n. 


(2) => (3). Let f : Z4 — B be a surjection. Define g : B + Z, by the equation 
2(b) = smallest element of f7'(fb)). 


Because f is surjective, f~!({b}) is nonempty; thus g is well defined. The map g is 
injective, for if b Æ b’, the sets f(b) and fey) are disjoint, so their smallest 
elements are different. 

(3) => (1). Let g : B — Z, be an injection; we wish to prove B is countable. 
By changing the range of g, we can obtain a bijection of B with a subset of Z}. Thus 
to prove our result, it suffices to show that every subset of Z, is countable. So let C 
be a subset of Z,. 

If C is finite, it is countable by definition. So what we need to prove is that every 
infinite subset C of Z is countably infinite. This statement is certainly plausible. For 
the elements of C can easily be arranged in an infinite sequence; one simply takes the 
set Z4 in its usual order and “erases” all the elements of Z, that are not in C! 

The plausibility of this argument may make one overlook its informality. Provid- 
ing a formal proof requires a certain amount of care. We state this result as a separate 
lemma, which follows. a 


Lemma 7.2. If C is an infinite subset of Z., then C is countably infinite. 


Proof. We define a bijection h : Z} — C. We proceed by induction. Define h(1) to 
be the smallest element of C; it exists because every nonempty subset C of Z4 has a 
smallest element. Then assuming that A(1),...,4(a — 1) are defined, define 


h(n) = smallest element of {C — A({1,...,2 — ip). 


The set C — A({1, ..., — 1}) is not empty; for if it were empty, then h : {1,...,n — 
1} — C would be surjective, so that C would be finite (by Corollary 6.7). Thus h(n) 
is well defined. By induction, we have defined h(n) for all n € Z+. 

To show that h is injective is easy. Given m < n, note that h(m) belongs to the set 
h({1,...,n — 1}), whereas h(n), by definition, does not. Hence h(n) 4 h(m). 

To show that h is surjective, let c be any element of C; we show that c lies in the 
image set of h. First note that A(Z} ) cannot be contained in the finite set {1,...,c}, 
because h(Z,.) is infinite (since A is injective). Therefore, there is an n in Z4, such 
that h(n) > c. Let m be the smallest element of Z}, such that h(m) > c. Then for all 
i < m, we must have h(i) < c. Thus, c does not belong to the set A({1,...,m — 1)). 
Since A(m) is defined as the smallest element of the set C — A({1,...,m — 1}), we 
must have h(m) < c. Putting the two inequalities together, we have h(m) = c, as 
desired. a 
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There is a point in the preceding proof where we stretched the principles of logic 
a bit. It occurred at the point where we said that “using the induction principle” we 
had defined the function h for all positive integers n. You may have seen arguments 
like this used before, with no questions raised concerning their legitimacy. We have 
already used such an argument ourselves, in the exercises of §4, when we defined a”. 

But there is a problem here. After all, the induction principle states only that if A 
is an inductive set of positive integers, then A = Z,. To use the principle to prove a 
theorem “by induction,” one begins the proof with the statement “Let A be the set of 
all positive integers n for which the theorem is true,” and then one goes ahead to prove 
that A is inductive, so that A must be all of Z4. 

In the preceding theorem, however, we were not really proving a theorem by in- 
duction, but defining something by induction. How then should we start the proof? 
Can we start by saying, “Let A be the set of all integers n for which the function h is 
defined”? But that's silly; the symbol h has no meaning at the outset of the proof. It 
only takes on meaning in the course of the proof. So something more is needed. 

What is needed is another principle, which we call the principle of recursive defi- 
nition. In the proof of the preceding theorem, we wished to assert the following: 

Given the infinite subset C of Z4, there is a unique function h : Z4 > C 
satisfying the formula: 


h(1) = smallest element of C, 


* 
(*) h(i) = smallest element of [C — h({1,...,1 —1)] foralli > 1. 
The formula (*) is called a recursion formula for h; it defines the function h in 
terms of itself. A definition given by such a formula is called a recursive definition. 
Now one can get into logical difficulties when one tries to define something recur- 
sively. Not all recursive formulas make sense. The recursive formula 


h(i) = smallest element of (C — A{{1,..., i +1]. 


for example, is self-contradictory; although h(i) necessarily is an element of the set 
h({1,.... i +1}), this formula says that it does not belong to the set. Another example 
is the classic paradox: 


Let the barber of Seville shave every man of Seville who does not shave himself. 
Who shall shave the barber? 


In this statement, the barber appears twice, once in the phrase “the barber of Seville” 
and once as an element of the set “men of Seville”; this definition of whom the barber 
shall shave is a recursive one. It also happens to be self-contradictory. 

Some recursive formulas do make sense, however. Specifically, one has the fol- 
lowing principle: 


Principle of recursive definition. Let A be a set. Given a formula that defines h(1) 
as a unique element of A, and fori > 1 defines h(i) uniquely as an element of A 
in terms of the values of h for positive integers less than i, this formula determines a 
unique function h : Z} — A. 
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This pnnciple is the one we actually used in the proof of Lemma 7.2. You can 
simply accept it on faith if you like. It may however be proved rigorously, using the 
principle of induction. We shall formulate it more precisely in the next section and 
indicate how it is proved. Mathematicians seldom refer to this pnnciple specifically. 
They are much more likely to wnte a proof like our proof of Lemma 7.2 above, a proof 
in which they invoke the “induction principle” to define a function when what they are 
really using is the principle of recursive definition. We shall avoid undue pedantry in 
this book by following their example. 


Corollary 7.3. A subset of a countable set is countable. 


Proof. Suppose A C B, where B is countable. There is an injection f of B into Z4; 
the restriction of f to A is an injection of A into Z4. a 


Corollary 7.4. The set Z4 x Z, is countably infinite. 


Proof. 1n view of Theorem 7.1, it suffices to construct an injective map f : Z4 x 
Z4 — Z4. We define f by the equation 


fin, m) = 2"3". 


It is easy to check that f is injective. For suppose that 273" = 2°37. Ifn < p, then 
3” = 2?-"39, contradicting the fact that 3” is odd for all m. Therefore, n = p. As 
a result, 3" = 37, Then if m < q, it follows that 1 = 34~”, another contradiction. 
Hence m = q. a 


EXAMPLE 3. The sei Q4 of positive rational numbers is countably infinite. For we can 
define a surjection g : Z, x Z, — Q+ by the equation 


g(n,m) = mn. 


Because Zs. x Z, is countable, there is a surjection f : Z} — Zs x Z}. Then the 
composite go f . Z4 — Q4 is a surjection, so that Q, is countable. And, of course, Q+ 
is infinite because it contains Z}. 

We leave it as an exercise to show the set Q of aff rational numbers is countably infinite. 


Theorem 7.5. A countable union of countable sets is countable. 


Proof. Let (An}ney be an indexed family of countable sets, where the index set J is 
either (1,..., N} or Z4. Assume that each set A, is nonempty, for convenience; this 
assumption does not change anything. 

Because each A, is countable, we can choose, for each n, a surjective function 
fn: Z4 — Ay. Similarly, we can choose a surjective function g : Z} —> J. Now 
define 


h : Z4 x Z} > (jan 


neJ 
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by the equation 

h(k, m) = fem). 
It is easy to check that h is surjective. Since Z+} x Z+ is in bijective correspondence 
with Z4, the countability of the union follows from Theorem 7.1. E 


Theorem 7.6. A finite product of countable sets is countable. 


Proof. First let us show that the product of two countable sets A and B is countable. 
The result is tnvial if A or B is empty. Otherwise, choose surjective functions f : 
Z, > A and g : Z4 — B. Then the function h : Z} x Z+ —> A x B defined by the 
equation h(n, m) = (f (n), g(m)) is surjective, so that A x B is countable. 

In general, we proceed by induction. Assuming that A, x --- x A,_1 is Countable 
if each A; is countable, we prove the same thing for the product A, x --- x An. First, 
note that there is a bijective correspondence 


BAL xX +++ X An — (Al X- X An-1) X Án 
defined by the equation 
BO, +. Xn) = ((X1, -+s Xn—1), Xn). 


Because the set Aj x --- x A,— is countable by the induction assumption and A, is 
countable by hypothesis, the product of these two sets is countable, as proved in the 
preceding paragraph. We conclude that A; x --- x A, is countable as well. a 


It is very tempting to assert that countable products of countable sets should be 
countable; but this assertion is in fact not true: 


Theorem 7.7. Let X denote the two element set {0, 1}. Then the set X® is uncount- 
able. 


Proof. We show that, given any function 
g:Z4 — X”, 
g is not surjective. For this purpose, let us denote g(7) as follows : 
E(N) = (Xn, Xn2, Xn3» -< -Xnms <->), 


where each x;; is either 0 or 1. Then we define an element y = (y1, y2,..-+ Yans...) 
of X® by letting 
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(If we wnte the numbers x, in a rectangular array, the particular elements x,, appear 
as the diagonal entries in this array; we choose y so that its nth coordinate differs from 
the diagonal entry Xnn.) 

Now y is an element of X®, and y does not lie in the image of g; given n, the 
point g(n) and the point y differ in at least one coordinate, namely, the nth. Thus, g is 
not surjective. a 


The cartesian product {0, 1}” is one example of an uncountable set. Another is the 
set P(Z,), as the following theorem implies: 


Theorem 7.8. Let A bea set. There is no injective map f : P(A) — A, and there is 
no surjective map g : A > P(A). 


Proof. In general, if B is a nonempty set, the existence of an injective map f : B > 
C implies the existence of a surjective map g : C —> B; one defines g(c) = f~!(c) 
for each c in the image set of f, and defines g arbitrarily on the rest of C. 

Therefore, it suffices to prove that given a map g : A > P(A), the map g is not 
surjective. For each a € A, the image g(a) of a is a subset of A, which may or may 
not contain the point a itself. Let B be the subset of A consisting of all those points a 
such that g(a) does not contain a; 


B = {a į a € A — g(a)}. 


Now, B may be empty, or it may be all of A, but that does not matter. We assert that B 
is a subset of A that does not lie in the image of g. For suppose that B = g(ag) for 
some ag € A. We ask the question: Does ap belong to B or not? By definition of B, 


ag E B <=> an E A — g(do) = ao E A-B. 


In either case, we have a contradiction. i 


Now we have proved the existence of uncountable sets. But we have not yet men- 
tioned the most familiar uncountable set of all—the set of real numbers. You have 
probably seen the uncountability of R demonstrated already. If one assumes that every 
real number can be represented uniquely by an infinite decimal (with the proviso that a 
representation ending in an infinite string of 9’s is forbidden), then the uncountability 
of the reals can be proved by a variant of the diagonal procedure used in the proof of 
Theorem 7.7. But this proof is in some ways not very satisfying. One reason is that 
the infinite decimal representation of a real number is not at all an elementary conse- 
quence of the axioms but requires a good deal of labor to prove. Another reason is 
that the uncountability of R does not, in fact, depend on the infinite decimal expansion 
of R or indeed on any of the algebraic properties of R; it depends on only the order 
properties of R. We shall demonstrate the uncountability of R, using only its order 
properties, in a later chapter. 
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Exercises 


1. Show that Q is countably infinite. 
2. Show that the maps f and g of Examples 1 and 2 are bijections. 


3. Let X be the two-element set {0, 1}. Show there is a bijective correspondence 
between the set P(Z4) and the cartesian product X®. 


4. (a) Areal number x is said to be algebraic (over the rationals) if it satis fies some 
polynomial equation of positive degree 


x" fagyx" | +- + ax + ao =0 


with rational coefficients a;. Assuming that each polynomial equation has 
only finitely many roots, show that the set of algebraic numbers is countable. 
A real number is said to be transcendental ìf it is not algebraic. Assuming 
the reals are uncountable, show that the transcendental numbers are uncount- 
able. (It is a somewhat surprising fact that only two transcendental numbers 
are familiar to us: e and 2. Even proving these two numbers transcendental 
is highly nontrivial.) 
5. Determine, for each of the following sets, whether or not it is countable. Justify 

your answers. 

(a) The set A of all functions f : {0,1} > Z+. 

(b) The set B, of all functions f : {I,..., n} > Z,. 

(c) The set C = Unezs Bn. 

(d) The set D of all functions f : Z} > Z+. 

(e) The set £ of all functions f : Z} — {0, 1}. 

(f) The set F of all functions f : Z} — {0,1} that are “eventually zero.” 

[We say that f is eventually zero if there is a positive integer N such that 
f(a) =0foralln > N] 

(g) The set G of all functions f : Z, —> Z, that are eventually 1. 

(h) The set H of all functions f : Z} — Z, that are eventually constant. 

(i) The set / of all two-element subsets of Z+. 

(j) The set J of all finite subsets of Z4. 
6. We say that two sets A and B have the same cardinality if there is a bijection 

of A with B. 

(a) Show that if B C A and if there is an injection 


f:A— B, 
then A and B have the same cardinality. [Hint: Define A; = A, Bı = B, 
and forn > 1, A, = f(An-1) and B, = f(Bn-1). (Recursive definition 


again!) Note that A; D Bı D A2 D B2 D A3 D---. Define a bijection 
h: A — B by the rule 


(b 


= 


f(x) ifx € A, — Bn for somen, 


h(x) = f 
otherwise.] 
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(b) Theorem (Schroeder-Bernstein theorem). If there are injections f : A —> 
C and g : C — A, then A and C have the same cardinality. 
7. Show that the sets D and E of Exercise 5 have the same cardinality. 
8. Let X denote the two-element set {0, 1}; let B be the set of countable subsets of 
X”. Show that X“ and B have the same cardinality. 
9. (a) The formula 
A(t) = 1, 
(*) h(2) = 2, 
h(n) = [h(n + DP — [h(n — 1)? forn >2 
is not one to which the principle of recursive definition applies. Show that 
nevertheless there does exist a function h : Z4 — R satisfying this formula. 
[Hint: Reformulate (*) so that the principle will apply and require h to be 
positive.} 
(b) Show that the formula (+) of part (a) does not determine h uniquely. [Hint: 
If h is a positive function satisfying (*), let f (i) = h(i) fori Æ 3, and let 


fB) = —h(3).] 
(c) Show that there is no function h : Z} — R satisfying the formula 
AD =1, 
h(2) = 2, 


h(n) = {h(n + DP + [h(n DP forn > 2. 


*$8 The Principle of Recursive Definition 


Before considering the general form of the principle of recursive definition, let us first 
prove it in a specific case, that of Lemma 7.2. That should make the underlying idea 
of the proof much clearer when we consider the general case. 

So, given the infinite subset C of Z4, let us consider the following recursion for- 
mula for a function h : Z4} > C: 


( h(l) = smallest element of C, 
* 
) h(i) = smallest element of [C — k({1,...,i—1)] fori>l. 


We shall prove that there exists a unique function h : Z} — C satisfying this recursion 
formula. 

The first step is to prove that there exist functions defined on sections {1,..., n} 
of Z, that satisfy (*): 


Lemma 8.1. Givenn € Z4, there exists a function 
f:{l,...,n}—> C 


that satisfies (*) for alli in its domain. 
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Proof. The point of this lemma is that it is a statement that depends on n; therefore, it 
is capable of being proved by induction. Let A be the set of all n for which the lemma 
holds. We show that A is inductive. It then follows that A = Z4. 

The lemma is true for n = 1, since the function f : {1} — C defined by the 
equation 


Ff (1) = smallest element of C 


satisfies (x). 

Supposing the lemma to be true for n — 1, we prove it true for n. By hypothesis, 
there is a function f’ : {l,...,2 — 1} —> C satisfying (*) for all ¢ in its domain. 
Define f : {1,...,} —> C by the equations 


fM=Ff forie{l,..., n- 1}, 
fín) = smallest element of [C — f'({1,...,n — 1})]. 


Since C is infinite, f’ is not surjective; hence the set C — f’({1,..., n — 1} is not 
empty, and f (n) is well defined. Note that this definition is an acceptable one; it does 
not define f in terms of itself but in terms of the given function f’. 

It is easy to check that f satisfies (+) for all i in its domain. The function f 
satisfies (*) fori < n — | because it equals f’ there. And f satisfies (x) fori = n 
because, by definition, 


fín) = smallest element of [C — f’({1,..., n—1)})] 


and f'({1,...,n— 1) = f({l,...,4—-1)). | 


Lemma 8.2. Suppose that f : {1,...,n} —> C andg: {l,....m} —> C both 
satisfy (*) for all i in their respective domains. Then f(i) = g(i) for all i in both 
domains. 


Proof. Suppose not. Let i be the smallest integer for which f(i) Æ g(i). The inte- 
ger i is not 1, because 


fC) = smallest element of C = g(1), 
by (*). Now for all j < i, we have f (j) = g(j). Because f and g satisfy (*), 


f(D = smallest element of [C — f({1,..., i — 1))], 
g(i) = smallest element of [C — g({1,...,i — 1})]. 


Since f({1,... i — 1) = g({1,..., i — 1)), we have f(i) = g(i), contrary to the 
choice of i. | 
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Theorem 8.3. There exists a unique function h : Z} — C satisfying (*) for all 
i € Z}. 

Proof. By Lemma 8.1, there exists for each n a function that maps {1, ..., n} into C 
and satisfies (*) for all i in its domain. Given n, Lemma 8.2 shows that this func- 
tion is unique; two such functions having the same domain must be equal. Let f, : 
(l,..., n} — C denote this unique function. 

Now comes the crucial step. We define a function h : Z} — C by defining its 
tule to be the union U of the rules of the functions fa. The rule for fa is a subset of 
{l,..., n} x C; therefore, U is a subset of Z} x C. We must show that U is the rule 
for a function h : Z4 — C. 

That is, we must show that each element i of Z} appears as the first coordinate of 
exactly one element of U. This is easy. The integer i lies in the domain of fn if and 
only if n > i. Therefore, the set of elements of U of which i is the first coordinate is 
precisely the set of all pairs of the form (i, f,(/)), for n > i. Now Lemma 8.2 tells us 
that fali) = f(t) if n,m > i. Therefore, all these elements of U are equal; that is, 
there is only one element of U that has i as its first coordinate. 

To show that h satisfies (+) is also easy; it is a consequence of the following facts: 


h(i) = fali) fori sa, 
Jn satisfies (*) for all ¢ in its domain. 


The proof of uniqueness is a copy of the proof of Lemma 8.2. a 


Now we formulate the general pnnciple of recursive definition. There are no new 
ideas involved in its proof, so we leave it as an exercise. 


Theorem 8.4 (Principle of recursive definition). Let A be a set; let ag be an el- 
ement of A. Suppose p is a function that assigns, to each function f mapping a 
nonempty section of the positive integers into A, an element of A. Then there exists a 
unique function 


h:Z}> A 
such that 


h(1) = ao, 


(*) hj p i 
(i) = p(h|\{1,...,i— 1) fori>l. 


The formula (*) is called a recursion formula for h. It specifies h(1), and it 
expresses the value of h ati > 1 in terms of the values of h for positive integers less 
than i. 


EXAMPLE |. Let us show that Theorem 8.3 is a special case of this theorem. Given the 
infinite subset C of Z4, let ap be the smallest element of C, and define p by the equation 


p(f) = smallest element of [C — (image set of f)}. 
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Because C is infinite and f is a function mapping a finite set into C, the image set of f is 
not all of C; therefore, p is well defined. By Theorem 8.4 there exists a function A : Z} => 
C such that A(1) = ag, and fori > 1, 


AG) = plh\{l.... i ~ 1) 
= smallest element of [C — (image set of Al{1,...,i — 1p] 
= smallest element of [C — A({1...,i — 1})], 


as desired. 


EXAMPLE 2. Givena € R, we “defined” a”, in the exercises of §4, by the recursion 
formula 


n aa, 


a =a a. 


We wish to apply Theorem 8.4 to define a function h : Z4 — R rigorously such that 
h(n) = a". To apply this theorem, let ap denote the element a of R, and define p by the 
equation p( f) = f(m)-a, where f : {1,..., m} — R. Then there exists a unique function 
A: Z, — R such that 


h(i) = a9, 
h(i) = p(AI{i,....i-1) ford > 1. 


This means that A(1) = a, and h(i) = h(i — 1) -a fori > 1. If we denote h(i) by a’, we 
have 


as desired. 


Exercises 


1. Let (b), b2,...) be an infinite sequence of real numbers. The sum ei by is 
defined by induction as follows : 


n 
J a=b forn = 1, 
k=] 


n a-i 
Yo bk = (È bx) + bn forn > 1. 
k=} k=] 


Let A be the set of real numbers; choose p so that Theorem 8.4 applies to define 
this sum ngorously. We sometimes denote the sum `z; bx by the symbol 
by + bo +--- + dy. 
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2. Let (b1, b2,...) be an infinite sequence of real numbers. We define the product 
[lk=; bk by the equations 


n-l 
be = ([] bx) -ba forn > l. 
k=l 


k 
Use Theorem 8.4 to define this product rigorously. We sometimes denote the 
product [ z; bk by the symbol b)b2--- bn. 
3. Obtain the definitions of a” and n! for n € Z4 as special cases of Exercise 2. 
4. The Fibonacci numbers of number theory are defined recursively by the formula 


A= =l, 
An =An-1+An-2 fora > 2. 


Define them rigorously by use of Theorem 8.4. 
5. Show that there is a unique function A : Z} — R+ satisfying the formula 


A(1) = 3, 
k =(hG -H 41)? fori > 1. 


6. (a) Show that there is no function h : Z} — Ry satisfying the formula 


AQ) =3, 
hli) = (AG—1I)-— 1? fori > 1. 


Explain why this example does not violate the principle of recursive defini- 
tion. 
(b) Consider the recursion formula 


ACL) = 3, 


_ JuUG@-)-1'? ifAG-1)>1 


tn fori > l. 
5 ifh(i-—1) <1 


h(i) 
Show that there exists a unique function h : Z} — Ry, satisfying this for- 
mula. 

7. Prove Theorem 8.4. 

8. Verify the following version of the principle of recursive definition: Let A be 
a set. Let p be a function assigning, to every function f mapping a section S, 
of Z, into A, an element p( f ) of A. Then there is a unique function h : Z} —> A 
such that h(n) = p(hiSn) for each n € Z4. 
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§9 Infinite Sets and the Axiom of Choice 


We have already obtained several cntena for a set to be infinite. We know, for instance, 
that a set A is infinite if it has a countably infinite subset, or if there is a bijection of A 
with a proper subset of itself. It turns out that either of these properties is sufficient 
to charactenze infinite sets. This we shall now prove. The proof will lead us into a 
discussion of a point of logic we have not yet mentioned—the axiom of choice. 


Theorem 9.1. Let A be a set. The following statements about A are equivalent: 

(1) There exists an injective function f : Z4 — A. 

(2) There exists a bijection of A with a proper subset of itself. 

(3) A ts infinite. 
Proof. We prove the implications (1) => (2) = (3) => (1). To prove that (1) = (2), 
we assume there is an injective function f : Z+ — A. Let the image set f(Z+) be 
denoted by B; and let f(n) be denoted by a,. Because f is injective, an Æ dm if 
n Æ m. Define 


8:4 — A- {a} 
by the equations 


2(an) =4n+, fora, € B, 
g(x) =x forx EA-B. 


The map g is indicated schematically in Figure 9.1; one checks easily that it is a 
bijection. 


Figure 9.1 


The implication (2) = (3) is just the contrapositive of Corollary 6.3, so it has 
already been proved. To prove that (3) = (1), we assume that A is infinite and 
construct “by induction” an injective function f : Z4 —> A. 

First, since the set A is not empty, we can choose a point a, of A; define f(1) to 
be the point so chosen. 

Then, assuming that we have defined f(1),.... f(— 1), we wish to define f(n). 
The set A— f((1,...,2—1}) is not empty; for if it were empty, the map f : {1,...,2— 
1} — A would be a surjection and A would be finite. Hence, we can choose an 
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element of the set A — f({1,..., n — 1}) and define f(n) to be this element. “Using 
the induction principle”, we have defined f for all n € Z,. 

It is easy to see that f is injective. For suppose that m < n. Then f(m) belongs to 

the set f({1,..., n — 1}), whereas f(n), by definition, does not. Therefore, f(n) Æ 

E 


f(m). 


Let us try to reformulate this “induction” proof more carefully, so as to make 
explicit our use of the principle of recursive definition. 

Given the infinite set A, we attempt to define f : Z} — A recursively by the 
formula 


fQ) =a, 


(+) f (i) = an arbitrary element of [A — f({1,..., i — 1p] fori > 1. 


But this is not an acceptable recursion formula at all! For it does not define f (i) 
uniquely in terms of f |{l,..., i — 1}. 

In this respect this formula differs notably from the recursion formula we consid- 
ered in proving Lemma 7.2. There we had an infinite subset C of Z}, and we defined h 
by the formula 


h(1) = smallest element of C, 
h(i) = smallest element of [C —A({I,...,6-1})] fori > 1. 


This formula does define A(i) uniquely in terms of Al{l,...,i — 1}. 

Another way of seeing that (*) is not an acceptable recursion formula is to note 
that if it were, the principle of recursive definition would imply that there is a unique 
function f : Z, — A satisfying (*). But by no stretch of the imagination does (*) 
specify f uniquely. In fact, this “definition” of f involves infinitely many arbitrary 
choices. 

What we are saying is that the proof we have given for Theorem 9.1 is not actually 
a proof. Indeed, on the basis of the properties of set theory we have discussed up to 
now, it is not possible to prove this theorem. Something more is needed. 

Previously, we described certain definite allowable methods for specifying sets: 

(1) Defining a set by listing its elements, or by taking a given set A and specifying a 
subset B of it by giving a property that the elements of B are to satisfy. 

(2) Taking unions or intersections of the elements of a given collection of sets, or 
taking the difference of two sets. 


(3) Taking the set of all subsets of a given set. 


(4) Taking cartesian products of sets. 
Now the rule for the function f is really a set: a subset of Z+ x A. Therefore, to prove 
the existence of the function f, we must construct the appropnate subset of Z} x A, 
using the allowed methods for forming sets. The methods already given simply are not 
adequate for this purpose. We need a new way of asserting the existence of a set. So, 
we add to the list of allowed methods of forming sets the following: 


§9 Infinite Sets and the Axiom of Choice 59 


Axiom of choice. Given a collection A of disjoint nonempty sets, there exists a set C 
consisting of exactly one element trom each element of A; that is, a set C such that C 
is contained in the union of the elements of A, and for each A € A, the set CNA 
contains a single element. 


The set C can be thought of as having been obtained by choosing one element 
from each of the sets in A. 

The axiom of choice certainly seems an innocent-enough assertion. And, in fact, 
most mathematicians today accept it as part of the set theory on which they base their 
mathematics. But in years past a good deal of controversy raged around this particular 
assertion concerning set theory, for there are theorems one can prove with its aid that 
some mathematicians were reluctant to accept. One such is the well-ordering theorem, 
which we shall discuss shortly. For the present we shall simply use the choice axiom 
to clear up the difficulty we mentioned in the preceding proof. First, we prove an easy 
consequence of the axiom of choice: 


Lemma 9.2 (Existence of a choice function). Given a collection B of nonempty 
sets (not necessarily disjoint), there exists a function 


c:B—> Us 
BeB 


such that c( B) is an element of B, for each B € B. 


The function c is called a choice function for the collection B. 

The difference between this lemma and the axiom of choice is that in this lemma 
the sets of the collection B are not required to be disjoint. For example, one can 
allow B to be the collection of al! nonempty subsets of a given set. 


Proof of the lemma. Given an element B of B, we define a set B’ as follows: 
B’ = {(B,x)| x € B}. 


That is, B’ is the collection of all ordered pairs, where the first coordinate of the ordered 
pair is the set B, and the second coordinate is an element of B. The set B’ is a subset 
of the cartesian product 

Bx U B. 


BeB 


Because B contains at least one element x, the set B’ contains at least the element 
(B, x), so it is nonempty. 

Now we claim that if B; and Bz are two different sets in B, then the corresponding 
sets B; and B; are disjoint. For the typical element of B; is a pair of the form (B1, x1) 
and the typical element of B; is a pair of the form (82, x2). No two such elements can 
be equal, for their first coordinates are different. Now let us form the collection 


C ={B' |B € B}; 
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it is a collection of disjoint nonempty subsets of 


Bx |B. 


BeB 


By the choice axiom, there exists a set c consisting of exactly one element from each 
element of C. Our claim is that c is the rule for the desired choice function. 
In the first place, c is a subset of 


Bx VB. 


BEB 


In the second place, c contains exactly one element from each set B’; therefore, for 
each B € B, the set c contains exactly one ordered pair (B, x) whose first coordinate 
is B. Thus c is indeed the rule for a function from the collection B to the set UJ geg B. 
Finally, if (B, x) € c, then x belongs to B, so that c(B) € B, as desired. 


A second proof of Theorem 9.1. Using this lemma, one can make the proof of 
Theorem 9.1 more precise. Given the infinite set A, we wish to construct an injective 
function f : Z+ — A. Let us form the collection B of all nonempty subsets of A. The 
lemma just proved asserts the existence of a choice function for B; that is, a function 


c:B—> LJ B=a 
BeB 


such that c(B) € B for each B € B. Let us now define a function f : Z} — A by the 
recursion formula 

f(1) = e(A), 

f@M=c(A— f1... i-1))) fori >1. 


Because A is infinite, the set A — f({1,...,¢ — 1}) is nonempty; therefore, the right 
side of this equation makes sense. Since this formula defines f (i) uniquely in terms of 


(*) 


f\(l,...,#— 1), the principle of recursive definition applies. We conclude that there 
exists a unique function f : Z} — A satisfying (+) for all i € Z4. Injectivity of f 
follows as before. E 


Having emphasized that in order to construct a proof of Theorem 9.1 that is logi- 
cally correct, one must make specific use of a choice function, we now backtrack and 
admit that in practice most mathematicians do no such thing. They go on with no 
qualms giving proofs like our first version, proofs that involve an infinite number of 
arbitrary choices. They know that they are really using the choice axiom; and they 
know that if it were necessary, they could put their proofs into a logically more sat- 
isfactory form by introducing a choice function specifically. But usually they do not 
bother. 

And neither will we. You will find few further specific uses of a choice function 
in this book; we shall introduce a choice function only when the proof would become 
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confusing without it. But there will be many proofs in which we make an infinite 
number of arbitrary choices, and in each such case we will actually be using the choice 
axiom implicitly. 

Now we must confess that in an earlier section of this book there is a proof in 
which we constructed a certain function by making an infinite number of arbitrary 
choices. And we slipped that proof in without even mentioning the choice axiom. Our 
apologies for the deception. We leave it to you to ferret out which proof it was! 

Let us make one final comment on the choice axiom. There are two forms of 
this axiom. One can be called the finite axiom of choice; it asserts that given a finite 
collection A of disjoint nonempty sets, there exists a set C consisting of exactly one 
element from each element of A. One needs this weak form of the choice axiom 
all the time; we have used it freely in the preceding sections with no comment. No 
mathematician has any qualms about the finite choice axiom; it is part of everyone’s 
set theory. Said differently, no one has qualms about a proof that involves only finitely 
many arbitrary choices. 

The stronger form of the axiom of choice, the one that applies to an arbétrary col- 
lection A of nonempty sets, is the one that ts properly called “the axiom of choice.” 
When a mathematician writes, “This proof depends on the choice axiom,” it ts invari- 
ably this stronger form of the axiom that is meant. 


Exercises 


1. Define an injective map f : Z4 — X®, where X ts the two-element set {0, 1}, 
without using the choice axiom. 


2. Find if possible a choice function for each of the following collections, without 
using the choice axiom: 
(a) The collection A of nonempty subsets of Z+. 
(b) The collection B of nonempty subsets of Z. 
(c) The collection C of nonempty subsets of the rational numbers Q. 
(d) The collection D of nonempty subsets of X°, where X = {0, 1}. 


3. Suppose that A is a set and { fn}jnez, is a given indexed family of injective func- 
tions 


fai {l,...,n} — A. 


Show that A is infinite. Can you define an injective function f : Z, > A 
without using the choice axiom? 


4. There was a theorem in §7 whose proof involved an infinite number of arbitrary 
choices. Which one was it? Rewnte the proof so as to make explicit the use of 
the choice axiom. (Several of the earlier exercises have used the choice axiom 
also.) 
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5. 


*8, 


(a) Use the choice axiom to show that if f : A —> B is surjective, then f has a 
right inverse h : B > A. 

(b) Show that if f : A —> B is injective and A is not empty, then f has a left 
inverse. Is the axiom of choice needed? 


. Most of the famous paradoxes of naive set theory are associated in some way or 


other with the concept of the “set of all sets.” None of the rules we have given for 

forming sets allows us to consider such a set. And for good reason—the concept 

itself is self-contradictory. For suppose that A denotes the “set of all sets.” 

(a) Show that P(A) C A; derive a contradiction. 

(b) (Russell's paradox.) Let B be the subset of A consisting of all sets that are 
not elements of themselves; 


B={A| Ae AandAd ¢ A}. 


(Of course, there may be no set A such that A € A; if such is the case, then 
B = A.) Is B an element of itself or not? 


. Let A and B be two nonempty sets. If there is an injection of B into A, but no 


injection of A into B, we say that A has greater cardinality than B. 

(a) Conclude from Theorem 9.1 that every uncountable set has greater cardinal- 
ity than Z4. 

(b) Show that if A has greater cardinality than B, and B has greater cardinality 
than C, then A has greater cardinality than C. 

(c) Find a sequence A), A2, ... of infinite sets, such that for each n € Z+, the 
set A,41 has greater cardinality than An. 

(d) Find a set that for every n has cardinality greater than A,. 


Show that P(Z+) and R have the same cardinality. [Hint: You may use the fact 
that every real number has a decimal expansion, which is unique if expansions 
that end in an infinite string of 9’s are forbidden.] 

A famous conjecture of set theory, called the continuum hypothesis, asserts 
that there exists no set having greater cardinality than Z} and lesser cardinality 
than R. The generalized continuum hypothesis asserts that, given the infinite 
set A, there is no set having greater cardinality than A and lesser cardinality 
than P(A). Surprisingly enough, both of these assertions have been shown to 
be independent of the usual axioms for set theory. For a readable expository 
account, see [Sm]. 


§10 Well-Ordered Sets 


One of the useful properties of the set Z, of positive integers is the fact that each of 
its nonempty subsets has a smallest element. Generalizing this property leads to the 
concept of a well-ordered set. 
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Definition. A set A with an order relation < is said to be well-ordered if every 
nonempty subset of A has a smallest element. 


EXAMPLE |. Consider the set {1,2} x Z} in the dictionary ordering. Schernatically, it 
can be represented as one infinite sequence followed by another infinite sequen ce: 


@;,42,43,...; by, b2, by... 


with the understanding that each element is less than every element to the right of it. It is 
not difficult to see that every nonempty subset C of this ordered set has a smallest element: 
If C contains any one of the elements ap, we simply take the smallest element of the 
intersection of C with the sequence a, a2,...; while if C contains no an, then it is a 
subset of the sequence b,, b2,... and aS such has a smallest element. 


EXAMPLE 2. Consider the set Z, x Z in the dictionary order. Schematically, it can be 
represented as an infinite sequence of infinite sequences. We show that it is well-ordered. 
Let X be a nonempty subset of Z} x Z+. Let A be the subset of Z consisting of all first 
coordinates of elements of X. Now A has a smallest element: call it ao. Then the collection 


(b| ap x be X} 


is a nonempty subset of Z, ; let bo be its smallest element. By definition of the dictionary 
order, ay x bọ is the smallest element of X. See Figure 10.1. 


e — @-r er o —> o 


e — o — o — > o — 


Figure 10.1 


EXAMPLE 3. The set of integers is not well-ordered in the usual order, the subset 
consisting of the negative integers has no smallest element. Nor is the set of real numbers in 
the interval 0 < x < | well-ordered; the subset consisting of those x for which O < x < | 
has no smallest element (although it has a greatest lower bound, of course). 


There are several ways of constructing well-ordered sets. Two of them are the 
following: 
(1) If A is a well-ordered set, then any subset of A is well-ordered in the restricted 
order relation. 
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(2) If A and B are well-ordered sets, then A x B is well-ordered in the dictionary 
order. 
The proof of (1) is trivial; the proof of (2) follows the pattern given in Example 2. 

It follows that the set Z, x (Zi x Z+) is well-ordered in the dictionary order; it 
can be represented as an infinite sequence of infinite sequences of infinite sequences. 
Similarly, (Z,)* is well-ordered in the dictionary order. And so on. But if you try to 
generalize to an infinite product of Z, with itself, you will run into trouble. We shall 
examine this situation shortly. 

Now, given a set A without an order relation, it is natural to ask whether there 
exists an order relation for A that makes it into a well-ordered set. If A is finite, any 
bijection 

f:A— {1,...,7} 


can be used to define an order relation on A; under this relation, A has the same order 
type as the ordered set {1,..., n}. In fact, every order relation on a finite set can be 
obtained in this way: 


Theorem 10.1. Every nonempty finite ordered set has the order type of a section 
blancs n} of Z,, so it is well-ordered. 


Proof. This was given as an exercise in §6; we prove it here. First, we show that 
every finite ordered set A has a largest element. If A has one element, this is trivial. 
Supposing it true for sets having  — 1 elements, let A have n elements and let ag € A. 
Then A — {ao} has a largest element a), and the larger of (ao, a1} is the largest element 
of A. 

Second, we show there is an order-preserving bijection of A with {1,...,} for 
some n. If A has one element, this fact is trivial. Suppose that it is true for sets 
having n — | elements. Let b be the largest element of A. By hypothesis, there is an 
order-preserving bijection 


f : A—{b} — {l.n 1}. 


Define an order-preserving bijection f : A — {1,..., n} by setting 
f(x) = f(x) forx #b, 
f(b) =n. n 


Thus, a finite ordered set has only one possible order type. For an infinite set, 
things are quite different. The well-ordered sets 


Z, 
{1 PEERY n} x Z4, 
Z x Zi, 


Z+ x (Z4 x Z4) 
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are all countably infinite, but they all have different order types, as you can check. 

All the examples we have given of well-ordered sets are ordenngs of countable 
sets. It is natural to ask whether one can find a well-ordered uncountable set. 

The obvious uncountable set to try is the countably infinite product 


X=Z,x Z,x--- = (Z4)” 


of Z with itself. One can generalize the dictionary order to this set in a natural way, 
by defining 


(a1, d2,...) < (by, b2,...) 
if for some n > 1, 
di = bi, for i <n and an < by. 


This is, in fact, an order relation on the set X; but unfortunately it is not a well-ordering. 
Consider the set A of all elements x of X of the form 


x=(1,...,1,2,1,1,...), 


where exactly one coordinate of x equals 2, and the others are all equal to 1. The set A 
clearly has no smallest element. 

Thus, the dictionary order at least does not give a well-ordenng of the set (Z+)®. 
Is there some other order relation on this set that is a well-ordering? No one has ever 
constructed a specific well-ordering of (Z,)°. Nevertheless, there is a famous theorem 
that says such a well-ordering exists: 


Theorem (Well-ordering theorem). IfA is a set, there exists an order relation on 
A that is a well-ordering. 


This theorem was proved by Zermelo in 1904, and it startled the mathematical 
world. There was considerable debate as to the correctness of the proof; the lack of 
any constructive procedure for well-ordering an arbitrary uncountable set led many to 
be skeptical. When the proof was analyzed closely, the only point at which it was found 
that there might be some question was a construction involving an infinite number of 
arbitrary choices, that is, a construction involving—the choice axiom. 

Some mathematicians rejected the choice axiom as a result, and for many years a 
legitimate question about a new theorem was: Does its proof involve the choice axiom 
or not? A theorem was considered to be on somewhat shaky ground if one had to use 
the choice axiom in its proof. Present-day mathematicians, by and large, do not have 
such qualms. They accept the axiom of choice as a reasonable assumption about set 
theory, and they accept the well-ordering theorem along with it. 

The proof that the choice axiom implies the well-ordering theorem is rather long 
(although not exceedingly difficult) and primanily of interest to logictans; we shall omit 
it. If you are interested, a proof is outlined in the supplementary exercises at the end 
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of the chapter. Instead, we shall simply assume the well-ordering theorem whenever 
we need it. Consider it to be an additional axiom of set theory if you like! 

We shall in fact need the full strength of this assumption only occasionally. Most 
of the time, all we need is the following weaker result: 


Corollary. There exists an uncountable well-ordered set. 


We now use this result to construct a particular well-ordered set that will prove to 
be very useful. 


Definition. Let X be a well-ordered set Givena € X, let Sy denote the set 
Sa = (x |x € X andx < a}. 


It is called the section of X by a. 


Lemma 10.2. There exists a well-ordered set A having a largest element Q, such that 
the section Sg of A by Q is uncountable but every other section of A is countable. 


Proof. We begin with an uncountable well-ordered set B Let C be the well-ordered 
set {1, 2} x B in the dictionary order; then some section of C is uncountable. (Indeed, 
the section of C by any element of the form 2 x b is uncountable.) Let Q be the 
smallest element of C for which the section of C by Q is uncountable Then let A 
consist of this section along with the element Q. a 


Note that Sg is an uncountable well-ordered set every section of which is count- 
able Its order type is in fact uniquely determined by this condition. We shall call it a 
minimal uncountable well-ordered set. Furthermore, we shall denote the well-ordered 
set A = Sg U {Q} by the symbol Šo {for reasons to be seen later) 

The most useful property of the set Sq for our purposes is expressed in the follow- 
ing theorem: 


Theorem 10.3. IfA is a countable subset of Sg, then A has an upper bound in Se 


Proof. Let A be a countable subset of Sp. For each a € A, the section Sa is count- 
able. Therefore, the union B = laea Sa is also countable Since Sg is uncountable, 
the set B is not all of Sq; let x be a point of Sg that is not in B. Then x is an upper 
bound for A. For if x < a for some a in A, then x belongs to S, and hence to B, 
contrary to choice. E 


Exercises 


1. Show that every well-ordered set has the least upper bound property. 


§10 
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. (a) Show that in a well-ordered set, every element except the largest (if one 


exists) has an immediate successor. 
(b) Find a set in which every element has an immediate successor that is not 
well-ordered. 


. Both {1,2} x Z4 and Z+ x {1,2} are well-ordered in the dictionary order. Do 


they have the same order type? 


. (a) Let Z_ denote the set of negative integers in the usual order. Show that 


a simply ordered set A fails to be well-ordered if and only if it contains a 
subset having the same order type as Z_. 

(b) Show that if A is simply ordered and every countable subset of A is well- 
ordered, then A is well-ordered. 


© Show the well-ordering theorem implies the choice axiom. 
. Let Sg be the minimal uncountable well-ordered set. 


(a) Show that Sg has no largest element. 

(b) Show that for every a € Sg, the subset {x | æ < x} is uncountable. 

(c) Let Xo be the subset of Sq consisting of all elements x such that x has no 
immediate predecessor. Show that Xo is uncountable. 


. Let J be a well-ordered set. A subset Jo of J is said to be inductive if for every 


aed, 
(Sq C Jo) => a € Jo 


Theorem (The principle of transfinite induction). If J is a well-ordered set 
and Jo is an inductive subset of J, then Jo = J. 


. (a) Let A; and A2 be disjoint sets, well-ordered by <; and <2, respectively. 


Define an order relation on A; U A2 by letting a < b either if a, b € A, and 
a <; b, orifa,b € Az anda <2 b, orifa € A, and b € A2. Show that this 
is a well-ordering 

(b) Generalize (a) to an arbitrary family of disjoint well-ordered sets, indexed 
by a well-ordered set. 


. Consider the subset A of (Z4 )® consisting of all infinite sequences of positive in- 


tegers x = (x1, x2, ...) that end in an infinite string of I's. Give A the following 

order: x < y if Xn < Yn and x; = y; fori > n. We call this the “antidictionary 

order” on A. 

(a) Show that for every n, there is a section of A that has the same order type as 
(Z)" in the dictionary order. 

(b) Show A is well-ordered. 


Theorem. Let J and C be well-ordered sets; assume that there is no surjective 
function mapping a section of J onto C. Then there exists a unique function 
h . J — C satisfying the equation 


(*) h(x) = smallest [C — h(S,)]} 
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foreach x € J, where S, is the section of J by x. 
Proof. 

(a) If h and k map sections of J, or all of J, into C and satisfy (x) for all x in 
their respective domains, show that h(x) = k(x) for all x in both domains. 

(b) If there exists a function h : Sy —> C satisfying (*), show that there exists a 
function k > Sy U {a} > C satisfying (*). 

(c) If K C J and for all a € K there exists a function ha ` Se —> C satisfying 
(*), show that there exists a function 


k: U Sa — C 
aeK 
satisfying (*). 
(d) Show by transfinite induction that for every $ € J, there exists a function 
hg : Sg — C satisfying (x). (Hint: If B has an immediate predecessor a, 
then Sg = Sq U {a}. If not, Sg is the union of all Sy witha < £.] 
(e) Prove the theorem. 
11. Let A and B be two sets Using the well-ordering theorem, prove that either they 
have the same cardinality, or one has cardinality greater than the other. (Hint: If 
there is no surjection f : A — B, apply the preceding exercise.] 


*§11 The Maximum Principle’ 


We have already indicated that the axiom of choice leads to the deep theorem that ev- 
ery set can be well-ordered. The axiom of choice has other consequences that are even 
more important in mathematics. Collectively referred to as “maximum pnnciples,” 
they come in many versions Formulated independently by a number of mathemati- 
cians, including F. Hausdorff, K. Kuratowski, S. Bochner, and M. Zom, during the 
years 1914-1935, they were typically proved as consequences of the well-ordering 
theorem. Later, it was realized that they were in fact equivalent to the well-ordering 
theorem. We consider several of them here. 
First, we make a definition. Given a set A, a relation < on A is called a strict 
partial order on A if it has the following two properties: 
(1) (Nonreflexivity) The relation a < a never holds. 
(2) (Transitivity) If a < band b < c, thena < c. 
These are just the second and third of the properties of a simple order (see §3); the 
comparability property is the one that is omitted. In other words, a strict partial order 
behaves just like a simple order except that it need not be true that for every pair of 
distinct points x and y in the set, either x < y or y < x. 
If < is a strict partial order on a set A, it can easily happen that some subset B 
of A is simply ordered by the relation; all that is needed is for every pair of elements 
of B to be comparable under <. 


T This section will be assumed in Chapters 5 and 14 
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Now we can state the following principle, which was first formulated by Hausdorff 
in 1914. 


Theorem (The maximum principle). Let A be a set; let < be a strict partial order 
on A. Then there exists a maximal simply ordered subset B of A. 


Said differently, there exists a subset B of A such that B is simply ordered by < 
and such that no subset of A that properly contains B is simply ordered by ~<. 


EXAMPLE 1 If A is any collection of sets, the relation “is a proper subset of” is a 
strict partial order on Æ. Suppose that A is the collection of all circular regions (interiors 
of circles) in the plane. One maximal simply ordered subcollection of A consists of all 
circular regions with centers at the origin Another maximal simply ordered subcollection 
consists of all circular regions bounded by circles tangent from the nght to the y-axis at the 
ongin See Figure 11 1. 


Figure 11.1 — 


EXAMPLE 2. If (xo, yo) and (x1, yı) are two points of the plane R?, define 


(xo, Yo) < (41, y1) 


if yọ = yı and xọ < xı This is a partial ordenng of R? under which two points are 
comparable only if they lie on the same horizontal line The maximal simply ordered sets 
are the honzontal lines in R? 


One can give an intuitive “proof” of the maximum principle that is rather appeal- 
ing. It involves a step-by-step procedure, which one can describe in physical terms as 
follows. Suppose we take a box, and put into it some of the elements of A according 
to the following plan: First we pick an arbitrary element of A and put it in the box. 
Then we pick another element of A. If it is comparable with the element in the box, 
we put it in the box too; otherwise, we throw it away. At the general step, we will have 
a collection of elements in the box and a collection of elements that have been tossed 
away. Take one of the remaining elements of A If it is comparable with everything 
in the box, toss it in the box, too; otherwise, throw it away. Similarly continue. After 
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you have checked all the elements of A, the elements you have in the box will be com- 
parable with one another, and thus they will form a simply ordered set. Every element 
not in the box will be noncomparable with at least one element in the box, for that was 
why it was tossed away. Hence, the simply ordered set in the box is maximal, for no 
larger subset of A can satisfy the comparability condition 

Now of course the weak point in the preceding “proof” comes when we said, 
“After you have checked all the elements of A.” How do you know you ever “get 
through” checking all the elements of A? If A should happen to be countable, it is not 
hard to make this intuitive proof into a real proof. Let us take the countably infinite 
case; the finite case is even easier. Index the elements of A bijectively with the positive 
integers, so that A = {ai, a2. .}. This indexing gives a way of deciding what order 
to test the elements of A in, and how to know when one has tested them all. 

Now we define a function h : Z} — {0, 1}, by letting it assign the value 0 to 
i if we “put a; in the box,” and the value 1 if we “throw a; away.” This means that 
h(1) = 0, and for i > 1, we have h(i) = 0 if and only if a; is comparable with every 
element of the set 


{aj | j < i and h(j) = 0}. 


By the principle of recursive definition, this formula determines a unique function 
h. Z4 — {0,1} Itis easy to check that the set of those a; for which h(j) = O is a 
maximal simply ordered subset of A. 

If A is not countable, a variant of this procedure will work, if we allow ourselves to 
use the well-ordering theorem. Instead of indexing the elements of A with the set Z4, 
we index them (in a bijective fashion) with the elements of some well-ordered set J, so 
that A = {aq | a € J}. For this we need the well-ordenng theorem, so that we know 
there is a bijection between A and some well-ordered set J. Then we can proceed as 
in the previous paragraph, letting a replace i in the argument. Strictly speaking, you 
need to generalize the principle of recursive definition to well-ordered sets as well, but 
that is not particularly difficult. (See the Supplementary Exercises.) 

Thus, the well-ordenng theorem implies the maximum principle. 

Although the maximum principle of Hausdorff was the first to be formulated and 
is probably the simplest to understand, there is another such principle that is nowadays 
the one most frequently quoted. It is popularly called “Zorm’s Lemma,” although Ku- 
ratowski (1922) and Bochner (1922) preceded Zorn (1935) in enunciating and proving 
versions of it. For a history and discussion of the tangled history of these ideas, see [C] 
or [Mo]. To state this principle, we need some terminology. 


Definition. Let A be a set and let < be a strict partial order on A. If B is a subset 
of A, an upper bound on B is an element c of A such that for every b in B, either 
b = corb < c. A maximal element of A is an element m of A such that for no 
element a of A does the relation m < a hold. 


Zorn’s Lemma. Let A be a set that is strictly partially ordered. If every simply 
ordered subset of A has an upper bound in A, then A has a maximal element. 
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Zorn’s lemma is an easy consequence of the maximum principle: Given A, the 
maximum principle implies that A has a maximal simply ordered subset B. The hy- 
pothesis of Zorm’s lemma tells us that B has an upper bound c in A. The element c is 
then automatically a maximal element of A. For if c < d for some element d of A, 
then the set B U {d}, which properly contains B, is simply ordered because b < d for 
every b € B. This fact contradicts maximality of B. 

It is also true that the maximum principle is an easy consequence of Zorn’s lemma. 
See Exercises 5-7. 

One final remark. We have defined what we mean by a strict partial order on a set, 
but we have not said what a partial order itself is. Let < be a strict partial order on a 
set A Suppose that we define a < b ifeithera < b ora = b. Then the relation < 1s 
called a partial order on A For example, the inclusion relation C on a collection of 
sets is a partial order, whereas proper inclusion is a strict partial order. 

Many authors prefer to deal with partial orderings rather than strict partial order- 
ings; the maximum principle and Zom’s lemma are often expressed in these terms. 
Which formulation is used is simply a matter of taste and convenience. 


Exercises 


1. If a and b are real numbers, define a < b if b — a is positive and rational. Show 
this is a strict partial order on R. What are the maximal simply ordered subsets? 


2. (a) Let < be a strict partial order on the set A. Define a relation on A by letting 
a < b ìf either a < b ora = b. Show that this relation has the following 
properties, which are called the partial order axioms: 

(i) a <a foralla € A. 
(ii) a < bandb <a => a =b. 
(iii) a < band b < c => a < c. 
(b) Let P be a relation on A that satisfies properties (i){ili). Define a relation $ 
on A by letting aSb if a Pb and a Æ b. Show that S is a strict partial order 
on A. 


3. Let A be a set with a strict partial order <; let x € A. Suppose that we wish to 
find a maximal simply ordered subset B of A that contains x. One plausible way 
of attempting to define B is to let B equal the set of all those elements of A that 
are comparable with x; 


B = {y | y € A and either x < y or y < x}. 


But this will not always work. In which of Examples | and 2 will this procedure 
succeed and in which will it not? 


4. Given two points (xo, yo) and (xı, y1) of R?, define 


(xo, yo) < (x1, y1) 
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if xo < xı and yo < yı. Show that the curves y = x? and y = 2 are maximal 
simply ordered subsets of R?, and the curve y = x? is not. Find all maximal 
simply ordered subsets. 
5. Show that Zorn’s lemma implies the following: 

Lemma (Kuratowski). Let A be a collection of sets. Suppose that for every 
subcollection B of A that is simply ordered by proper inclusion, the union of the 
elements of B belongs to A Then A has an element that is properly contained 
in no other element of A. 


6. A collection A of subsets of a set X is said to be of finite type provided that a 
subset B of X belongs to A if and only if every finite subset of B belongs to A. 
Show that the Kuratowski lemma implies the following: 

Lemma (Tukey, 1940). Let A be a collection of sets. If A is of finite type, then 
A has an element that is properly contained in no other element of A. 


7. Show that the Tukey lemma implies the Hausdorff maximum principle. [Hint: 
If < is a strict partial order on A, let A be the collection of ali subsets of A that 
are simply ordered by <. Show that A is of finite type.] 


8. A typical use of Zom’s lemma in algebra is the proof that every vector space 
has a basis. Recall that if A is a subset of the vector space V, we say a vector 
belongs to the span of A if it equals a finite linear combination of elements of A. 
The set A is independent if the only finite linear combination of elements of A 
that equals the zero vector is the trivial one having all coefficients zero. If A is 
independent and if every vector in V belongs to the span of A, then A is a basis 
for V. 

(a) If A is independent and v € V does not belong to the span of A, show AU{v} 
is independent. 

(b) Show the collection of all independent sets in V has a maximal element. 

(c) Show that V has a basis. 


*Supplementary Exercises: Well-Ordering 


In the following exercises, we ask you to prove the equivalence of the choice axiom, 
the well-ordering theorem, and the maximum principle. We comment that of these 
exercises, only Exercise 7 uses the choice axiom. 

1. Theorem (General principle of recursive definition). Let J be a well-ordered 
set; letC be a set. Let F be the set of all functions mapping sections of J into C. 
Given a function p : F — C, there exists a unique function h : J — C such 
that h(a) = p(h|S,) for eacha € J. 

(Hint: Follow the pattern outlined in Exercise 10 of §10.) 


2. (a) Let J and E be well-ordered sets, let h : J — E. Show the following two 
statements are equivalent: 


(i) A is order preserving and its image is E or a section of E. 
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(ii) h(a) = smallest [E — A(S,)] for all a. 
(Hint: Show that each of these conditions implies that A(Sq) is a section of 
E, conclude that it must be the section by h(a).] 

(b) If E is a well-ordered set, show that no section of E has the order type of E, 
nor do two different sections of E have the same order type. [Hint: Given J, 
there is at most one order-preserving map of J into E whose image is E or 
a section of E.] 


. Let J and E be well-ordered sets; suppose there is an order-preserving map 


k : J — E. Using Exercises 1 and 2, show that J has the order type of E or 
a section of E. (Hint: Choose eo € E. Define h : J — E by the recursion 
formula 


h(a) = smallest [E — h(S,)] if A(Su) ŻE, 


and h(@) = eg otherwise. Show that h(a) < k(q@) for all œ; conclude that 
h(S,) # E for all æ.) 


. Use Exercises 1-3 to prove the following’ 


(a) If A and B are well-ordered sets, then exactly one of the following three 
conditions holds: A and B have the same order type, or A has the order type 
of a section of B, or B has the order type of a section of A. (Hint: Form 
a well-ordered set containing both A and B, as in Exercise 8 of §10; then 
apply the preceding exercise.) 

Suppose that A and B are well-ordered sets that are uncountable, such that 
every section of A and of B is countable. Show A and B have the same order 


type. 


(b 


~~ 


. Let X be a set; let A be the collection of all pairs (A, <), where A is a subset 


of X and < is a well-ordering of A. Define 
(A, <) <(A’, <’) 


if (A, <) equals a section of (A’, <’). 

(a) Show that < is a stnct partial order on A. 

(b) Let be a subcollection of A that is simply ordered by <. Define B’ to be 
the union of the sets B, for all (B, <) € B; and define <’ to be the union 
of the relations <, for all (B, <) € B. Show that (B’, <’) is a well-ordered 
set. 


. Use Exercises 1 and 5 to prove the following: 


Theorem. The maximum principle is equivalent to the well-ordenng theorem. 


. Use Exercises 1-5 to prove the following. 


Theorem. The choice axiom is equivalent to the well-ordering theorem. 

Proof. Let X bea set; let c be a fixed choice function for the nonempty subsets 
of X. If T is a subset of X and < is a relation on T, we say that (T, <) is a tower 
in X if < is a well-ordenng of T and if for each x € T, 


x=c(X -§,(7)), 
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where S,(T) is the section of T by x. 

(a) Let (Ti, <1) and (T2, <2) be two towers in X. Show that either these two 
ordered sets are the same, or one equals a section of the other. [Hint: Switch- 
ing indices if necessary, we can assume that h : Ti —> 72 is order preserving 
and (Tı) equals either T) or a section of T2. Use Exercise 2 to show that 
h(x) = x for all x.] 

(b) If (T, <) is a tower in X and T # X, show there is a tower in X of which 
(T, <) is a section. 

(c) Let {(T, <z)}& € K} be the collection of all towers in X. Let 


T= U Tą and <= U». 


keK keK 


Show that (T, <) is a tower in X. Conclude that T = X. 


8. Using Exercises 1—4, construct an uncountable well-ordered set, as follows. Let 
A be the collection of all pairs (A, <), where A is a subset of Z} and < is a well- 
ordering of A. (We allow A to be empty.) Define (A, <) ~ (A’, <’) if (A, <) 
and (A’, <’) have the same order type. It is trivial to show this is an equivalence 
relation. Let [(A, <)] denote the equivalence class of (A, <); let E denote the 
collection of these equivalence classes. Define 


((A, <)] « (4, <] 


if (A, <} has the order type of a section of (A’, <’). 

(a) Show that the relation < is well defined and is a simple order on E. Note 

that the equivalence class [(@, )} is the smallest element of E. 

Show that if a = [(A, <)] is an element of E, then (A, <) has the same 

order type as the section Sy(E) of E by a. [Hint: Define a map f : A > E 

by setting f(x) = [(Sx (A), restriction of <)] for each x € A.) 

(c) Conclude that E is well-ordered by <. 

(d) Show that £ is uncountable. [Hint: If h : E > Z, is a bijection, then A 
gives rise to a well-ordering of Z.] 

This same argument, with Z} replaced by an arbitrary well-ordered set X, 
proves (without use of the choice axiom) the existence of a well-ordered set E 
whose cardinality is greater than that of X. 

This exercise shows that one can construct an uncountable well-ordered set, 
and hence the minimal uncountable well-ordered set, by an explicit construction 
that does not use the choice axiom. However, this result is less interesting than it 
might appear. The crucial property of Sg, the one we use repeatedly, is the fact 
that every countable subset of Sg has an upper bound in Sg. That fact depends, 
in turn, on the fact that a countable union of countable sets is countable. And the 
proof of that result (if you exainine it carefully) involves an infinite number of 
arbitrary choices—that is, it depends on the choice axiom. 

Said differently, without the choice axiom we may be able to construct the 
minimal uncountable well-ordered set, but we can’t use it for anything! 


(b 
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Chapter 2 


Topological Spaces 
and Continuous Functions 


The concept of topological space grew out of the study of the real line and euclidean 
space and the study of continuous functions on these spaces. In this chapter, we de- 
fine what a topological space is, and we study a number of ways of constructing a 
topology on a set so as to make it into a topological space. We also consider some 
of the elementary concepts associated with topological spaces. Open and closed sets, 
limut points, and continuous functions are introduced as natural generalizations of the 
corresponding ideas for the real line and euclidean space. 


§12 Topological Spaces 


The definition of a topological space that is now standard was a long time in being 
formulated. Various mathematicians—Fréchet, Hausdorff, and others—proposed dif- 
ferent definitions over a period of years during the first decades of the twentieth cen- 
tury, but it took quite a while before mathematicians settled on the one that seemed 
most suitable. They wanted, of course, a definition that was as broad as possible, 
so that it would include as special cases all the various examples that were useful 
in mathematics—euclidean space, infinite-dimensional euclidean space, and function 
spaces among them—but they also wanted the definition to be narrow enough that the 
standard theorems about these familiar spaces would hold for topological spaces in 
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general. This is always the problem when one is trying to formulate a new mathe- 
matical concept, to decide how general its definition should be. The definition finally 
settled on may seem a bit abstract, but as you work through the various ways of con- 
structing topological spaces, you will get a better feeling for what the concept means. 


Definition. A topology on a set X is a collection F of subsets of X having the 
foliowing properties: 

(1) @and X are inf. 

(2) The union of the elements of any subcollection of F is in F. 

(3) The intersection of the elements of any finite subcollection of F is in F. 
A set X for which a topology 7 has been specified is called a topological space. 


Properly speaking, a topological space is an ordered pair (X, 7) consisting of a 
set X and a topology T on X, but we often omit specific mention of F if no confusion 
will anse. 

If X is a topological space with topology 7, we say that a subset U of X is an 
open set of X if U belongs to the collection 7. Using this terminology, one can say 
that a topological space is a set X together with a collection of subsets of X, called 
open sets, such that Ø and X are both open, and such that arbitrary unions and finite 
intersections of open sets are open. 


EXAMPLE | Let X be a three-element set, X = {a,b,c} There are many possible 
topologies on X, some of which are indicated schematically in Figure 12.1. The diagram 
in the upper right-hand corner indicates the topology in which the open sets are X, Ø, 
{a, b}, {b}, and {b,c} The topology in the upper left-hand comer contains only X and Ø, 
while the topology in the lower nght-hand corner contains every subset of X. You can get 
other topologies on X by permuting a, b, and c 


Figure 12.1 


From this example, you can see that even a three-element set has many different 
topologies. But not every collection of subsets of X is a topology on X Neither of the 
collections indicated in Figure 12 2 is a topology, for instance. 
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Q0: 


Figure 12.2 


EXAMPLE 2. If X is any set, the collection of all subsets of X is a topology on X, it is 
called the discrete topology The collection consisting of X and Ø only is also a topology 
on X; we shall call it the indiscrete topology, or the trivial topology 


EXAMPLE 3. Let X bea set; let Ty be the collection of all subsets U of X such that X -U 
either is finite or is all of X Then Tp is a topology on X, called the finite complement 
topology. Both X and @ are in Ty, since X — X is finite and X — @ is all of X If {Ua} is 
an indexed family of nonempty elements of T4, to show that |_) Ug is in T}, we compute 


X= |] Ua = (XX - Va). 


The latter set is finite because each set X — Ug is finite If Uj,  , Un are nonempty 
elements of Fy, to show that f) U, is in Ty, we compute 


x Sa = Ja ~U;). 
i=! i=l 


The Jatter set is a finite union of finite sets and, therefore, finite 


EXAMPLE 4 Let X be a set; let J, be the collection of all subsets U of X such that 
X — U either is countable or is all of X. Then F; is a topology on X, as you can check 


> 


Definition. Suppose that F and J’ are two topologies on a given set X. If FT’ DT, 
we say that J’ is finer than J; if J’ properly contains F , we say that F’ is strictly 
finer than T. We also say that F is coarser than J’, or strictly coarser, in these two 
respective situations. We say T is comparable with T’ if either 7’ > F or FT DT’. 


This terminology is suggested by thinking of a topological space as being some- 
thing like a truckload full of gravel—the pebbles and all unions of collections of peb- 
bles being the open sets. If now we smash the pebbles into smaller ones, the collection 
of open sets has been enlarged, and the topology, like the gravel, is said to have been 
made finer by the operation. 

Two topologies on X need not be comparable, of course. In Figure 12 1 preced- 
ing, the topology in the upper right-hand corner is strictly finer than each of the three 
topologies in the first column and strictly coarser than each of the other topologies in 
the third column. But it is not comparable with any of the topologies in the second 
column. 

Other terminology is sometimes used for this concept. If T’ > T, some math- 
ematicians would say that 7’ is larger than F, and F is smaller than J’. This is 
certainly acceptable terminology, if not as vivid as the words “finer” and “coarser.” 
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Many mathematicians use the words “weaker” and “stronger” in this context. Un- 
fortunately, some of them (particularly analysts) are apt to say that J’ is stronger 
than F if 7’ > T, while others (particularly topologists) are apt to say that J’ is 
weaker than J in the same situation! If you run across the terms “strong topology” 
or “weak topology” in some book, you will have to decide from the context which 
inclusion is meant. We shall not use these terms in this book. 


$13 Basis for a Topology 


For each of the examples in the preceding section, we were able to specify the topology 
by describing the entire collection 7 of open sets. Usually this is too difficult. In 
most cases, one specifies instead a smaller collection of subsets of X and defines the 
topology in terms of that. 


Definition. If X is a set, a basis for a topology on X is a collection 8 of subsets of X 
(called basis elements) such that 

(1) For each x € X, there is at least one basis element B containing x. 

(2) If x belongs to the intersection of two basis elements B; and B2, then there is a 

basis element 83 containing x such that B3 C B1 Bo. 

If 8 satisfies these two conditions, then we define the topology T generated by B as 
follows: A subset U of X is said to be open in X (that is, to be an element of 7 ) if for 
each x € U, there is a basis element B € B such that x € B and B C U. Note that 
each basis element is itself an element of F. 


We will check shortly that the collection F is indeed a topology on X. But first let 
us consider some examples. 


EXAMPLE | Let B be the collection of all circular regions (intenors of circles) in the 
plane. Then & satisfies both conditions for a basis The second condition is illustrated in 
Figure 13 l. In the topology generated by B, a subset U of the plane is open if every x 
in U lies in some circular region contained in U 


Figure 13.1 Figure 13.2 
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EXAMPLE 2. Let 8’ be the collection of all rectangular regions (intenors of rectangles) 
in the plane, where the rectangles have sides parallel to the coordinate axes Then 3’ 
satisfies both conditions for a basis. The second condition is illustrated in Figure 13 2; in 
this case, the condition is trivial, because the intersection of any two basis elements is itself 
a basis element (or empty) As we shall see later, the basis 8’ generates the same topology 
on the plane as the basis 8 given in the preceding example 


EXAMPLE 3 If X is any set, the collection of all one-point subsets of X is a basis for 
the discrete topology on X 


Let us check now that the collection 7 generated by the basis B is, in fact, a 
topology on X. If U is the empty set, it satisfies the defining condition of openness 
vacuously. Likewise, X is in J, since for each x € X there is some basis element 
B containing x and contained in X Now let us take an indexed family (Ua Jecy, of 
elements of J and show that 

U =| ]Ua 


ael 


belongs to 7. Given x € U, there is an index a such that x € Uy. Since Ug is open, 
there is a basis element B such that x € B C Ue. Then x € Band B C U, so that U 
is open, by definition. 

Now let us take two elements U, and U2 of F and show that U, NU? belongs to F. 
Given x € U, NU, choose a basis element B; containing x such that B, C U, ; choose 
also a basis element 8z containing x such that B? C U2. The second condition for a 
basis enables us to choose a basis element B3 containing x such that B3 C By N Bo. 
See Figure 13.3. Then x € B3, and B3 C U; N U2, so Ui N U2 belongs to F, by 
definition. 


Figure 13.3 


Finally, we show by induction that any finite intersection U1 N- +° Up of elements 
of F isin J. This fact is trivial for n = 1; we suppose it true for n — 1 and prove it 
for n. Now 


(UIN - A Un) = (UN OU) NUn 
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By hypothesis, U; N - -MU,—, belongs to 7; by the result just proved, the inter- 
section of U1 N -- -N Un- and U, also belongs to T 

Thus we have checked that collection of open sets generated by a basis 8 is, in 
fact, a topology. 

Another way of describing the topology generated by a basis is given in the fol- 
lowing lemma: 


Lemma 13.1. Let X be a set; let B be a basis for a topology T on X. Then T equals 
the collection of all unions of elements of B. 


Proof. Given a collection of elements of B, they are also elements of 7. Because T 
is a topology, their union is in 7. Conversely, given U € J, choose for each x € U 
an element B, of B such that x € B, C U. Then U = (Uey Bx, so U equals a union 
of elements of 2. a 


This lemma states that every open set U in X can be expressed as a union of 
basis elements. This expression for U is not, however, unique. Thus the use of the 
term “basis” in topology differs drastically from its use in linear algebra, where the 
equation expressing a given vector as a linear combination of basis vectors is unique. 

We have described in two different ways how to go from a basis to the topology 
it generates. Sometimes we need to go in the reverse direction, from a topology to a 
basis generating it. Here is one way of obtaining a basis for a given topology; we shall 
use it frequently. 


Lemma 13.2. Let X be a topological space. Suppose that C is a collection of open 
sets of X such that for each open set U of X and each x in U, there is an element C 
of C such that x € C C U. Then C is a basis for the topology of X. 


Proof. We must show that € is a basis. The first condition for a basis is easy: Given 
x € X, since X is itself an open set, there is by hypothesis an element C of C such 
that x € C C X. To check the second condition, let x belong to C1 N C2, where C] 
and C2 are elements of C. Since C; and C2 are open, so is C} N C2. Therefore, there 
exists by hypothesis an element C3 in C such that x € C3 C C1 N C2. 

Let 7 be the collection of open sets of X; we must show that the topology 
generated by C equals the topology T . First, note that if U belongs to T and if x € U, 
then there is by hypothesis an element C of C such that x € C C U. It follows that U 
belongs to the topology T’, by definition. Conversely, if W belongs to the topology 7’, 
then W equals a union of elements of C, by the preceding lemma. Since each element 
of C belongs to F and 7 is a topology, W also belongs to 7. | 


ad 


When topologies are given by bases, it is useful to have a criterion in terms of the 
bases for determining whether one topology is finer than another. One such criterion 
is the following. 
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Lemma 13.3. Let B and B’ be bases for the topologies T and T’, respectively, on 
X. Then the following are equivalent: 
(1) J’ is finer than F. 
(2) For each x € X and each basis element B € B containing x, there is a basis 
element B’ € B’ such that x € BY C B. 


Proof. (2) = (1). Given an element U of F, we wish to show that U € J’. Let 
x € U. Since B generates 7, there is an element B € B such that x € B C U. 
Condition (2) tells us there exists an element B’ € B’ such that x € B’ C B. Then 
x€ B’ C U,soU €7’, by definition. 

(1) = (2). We are given x € X and B € 8, with x € B. Now B belongs to T 
by definition and 7 C F” by condition (1); therefore, B € 7’ Since T’ is generated 
by 8’, there is an element B’ € B’ such that x € B’ C B. a 


Some students find this condition hard to remember. “Which way does the inclu- 
sion go?” they ask. It may be easier to remember if you recall the analogy between 
a topological space and a truckload full of gravel. Think of the pebbles as the basis 
elements of the topology; after the pebbles are smashed to dust, the dust particles are 
the basis elements of the new topology. The new topology is finer than the old one, 
and each dust particle was contained inside a pebble, as the criterion states. 

EXAMPLE 4. One can now see that the collection 8 of all circular regions in the plane 


generates the same topology as the collection 8’ of all rectangular regions, Figure 13 4 
illustrates the proof We shall teat this example more formally when we study metne 


spaces 
C) [| l 
B' 8 


Figure 13.4 


We now define three topologies on the real line R, all of which are of interest. 


Definition. If B is the collection of all open intervals in the real line, 
(a,b) ={x]a <x <b}, 


the topology generated by 8 is called the standard topology on the real line. Whenever 
we consider R, we shall suppose it is given this topology unless we specifically state 
otherwise. If B’ is the collection of all half-open intervals of the form 


la, b) = {x |a < x < b}, 
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where a < b, the topology generated by B’ is called the lower limit topology on R. 
When R is given the lower limit topology, we denote it by Rz . Finally let K denote the 
set of all numbers of the form 1/n, forn € Z+, and let B” be the collection of all open 
intervals (a, b), along with all sets of the form (a,b) — K. The topology generated 
by B” will be called the K-topology on R. When R is given this topology, we denote 
it by Rx. 


It is easy to see that all three of these collections are bases; in each case, the 
intersection of two basis elements is either another basis element or is empty. The 
relation between these topologies is the following: 


Lemma 13.4. The topologies of Re and Rx are stnctly finer than the standard topol- 
ogy onR, but are not comparable with one another. 


Proof. Let T, 7’, and J” be the topologies of R, Re, and Rx, respectively. Given 
a basis element (a, b) for J and a point x of (a, b), the basis element [x, b) for 7’ 
contains x and lies in (a, b). On the other hand, given the basis element [x, d) for J’, 
there is no open interval (a, b) that contains x and lies in [x, d). Thus 7’ is strictly 
finer than F 
A similar argument applies to Rx. Given a basis element (a,b) for T and a 
point x of (a, b), this same interval is a basis element for 7” that contains x. On the 
other hand, given the basis element B = (—1, 1) — K for 7” and the point 0 of B, 
there is no open interval that contains 0 and lies in B. 
We leave it to you to show that the topologies of Ry and Rx are not comparable. 
B 


A question may occur to you at this point. Since the topology generated by a 
basis B may be described as the collection of arbitrary unions of elements of 8, what 
happens if you start with a given collection of sets and take finite intersections of 
them as well as arbitrary unions? This question leads to the notion of a subbasis for a 


topology 


Definition. A subbasis S for a topology on X is a collection of subsets of X whose 
union equals X. The topology generated by the subbasis S is defined to be the collec- 
tion J of all unions of finite intersections of elements of S. 


We must of course check that F is a topology. For this purpose it will suffice to 
show that the collection & of all finite intersections of elements of $ is a basis, for 
then the collection 7 of ali unions of elements of B is a topology, by Lemma 13.1. 
Given x € X, it belongs to an element of $ and hence to an element of B; this is the 
first condition for a basis. To check the second condition, let 

Bi= SiN: -NS, and Be =S)N---NS, 


n 


be two elements of B. Their intersection 


Bi A Bz = (S1 A -A Smd NASO NA Sh) 
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is also a finite intersection of elements of $, so it belongs to 8. 


Exercises 


1. 


8. 


Let X be a topological space; let A be a subset of X. Suppose that foreach x € A 
there is an open set U containing x such that U C A. Show that A is open in X 


. Consider the nine topologies on the set X = {a,b,c} indicated in Example 1 


of §12. Compare them; that is, for each pair of topologies, determine whether 
they are comparable, and if so, which is the finer. 


. Show that the collection Te given in Example 4 of §12 is a topology on the set X. 


Is the collection 
Too = {U | X — U is infinite or empty or all of X} 


a topology on X? 


. (a) If {Ta} is a family of topologies on X, show that f) Ty is a topology on X. 


Is U Ta a topology on X? 

(b) Let {Te} be a family of topologies on X. Show that there is a unique small- 
est topology on X containing all the collections Fx, and a unique largest 
topology contained in all Fg. 

(c) If X = {a,b,c}, let 


J; = {Ø, X, {a}, {a,b}} and D= {Ø, X, {a}, {b, ch}. 


Find the smallest topology containing 7, and 7, and the largest topology 
contained in J) and 9. 


© Show that if A is a basis for a topology on X, then the topology generated by A 


equals the intersection of all topologies on X that contain A. Prove the same if 
A is a subbasis. 


. Show that the topologies of Rz and Rx are not comparable. 
. Consider the following topologies on R: 


J, = the standard topology, 

T = the topology of Rx, 

73 = the finite complement topology, 

Ta = the upper limit topology, having all sets (a, b) as basis, 


> 


3 = the topology having all sets (—00, a) = {x | x < a} as basis. 


Determine, for each of these topologies, which of the others it contains. 
(a) Apply Lemma 13.2 to show that the countable collection 


B = {(a, b) | a < b,a and b rational) 
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is a basis that generates the standard topology on R. 
(b) Show that the collection 


C = {[a, b) | a < b, a and b rational} 


is a basis that generates a topology different from the lower limit topology 
on R. 


§14 The Order Topology 


If X is a simply ordered set, there is a standard topology for X, defined using the order 
relation. It is called the order topology; in this section, we consider it and study some 
of its properties. 

Suppose that X is a set having a simple order relation <. Given elements a and b 
of X such that a < b, there are four subsets of X that are called the intervals deter- 
mined by a and b. They are the following : 


(a,b) =(x|a <x <b}, 
(a,b) = {x |a <x <b}, 
[a, b) = {x |a < x <b}, 
[a,b] = {x |a <x <b}. 


The notation used here is familiar to you already in the case where X is the real line, 
but these are intervals in an arbitrary ordered set. A set of the first type is called an 
open interval in X, a set of the last type is called a closed interval in X, and sets of the 
second and third types are called half-open intervals. The use of the term “open” in 
this connection suggests that open intervals in X should turn out to be open sets when 
we put a topology on X. And so they will. 


Definition. Let X be a set with a simple order relation; assume X has more than one 
element. Let B be the collection of all sets of the following types: 

(1) All open intervals (a, b) in X. 

(2) All intervals of the form [ao, b), where ag is the smallest element (if any) of X. 

(3) All intervals of the form (a, bo], where bo is the largest element (if any) of X. 
The collection & is a basis for a topology on X, which is called the order topology. 


If X has no smallest element, there are no sets of type (2), and if X has no largest 
element, there are no sets of type (3). 

One has to check that & satisfies the requirements for a basis. First, note that every 
element x of X lies in at least one element of 8: The smallest element (if any) lies 
in all sets of type (2), the largest element (if any) lies in all sets of type (3), and every 
other element lies in a set of type (1). Second, note that the intersection of any two sets 
of the preceding types is again a set of one of these types, or is empty. Several cases 
need to be checked; we leave it to you. 
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EXAMPLE | The standard topology on R, as defined in the preceding section, is just the 
order topology denved from the usual order on R. 


EXAMPLE 2. Consider the set R x R in the dictionary order, we shall denote the general 
element of R x R by x x y, to avoid difficulty with notation The set R x R has neither a 
largest nor a smallest element, so the order topology on R x R has as basis the collection 
of all open intervals of the form (a x b,c x d) fora < c, and fora = c and b < d. These 
two types of intervals are indicated in Figure 14.1. The subcollection consisting of only 
intervals of the second type is also a basis for the order topology on R x R, as you can 


check 


axb 


Figure 14.1 


EXAMPLE3 The positive integers Z, form an ordered set with a smallest element. The 
order topology on Z4 is the discrete topology, for every one-point set is open [fn > 1, 
then the one-point set {n} = (n — 1, n + 1) is a basis element; and if n = 1, the one-point 
set {1} = [1, 2) is a basis element. 


EXAMPLE 4 The set X = {1,2} x Z4 in the dictionary order is another example of 
an ordered set with a smallest element Denoting 1 x n by a, and 2 x n by bn, we can 
represent X by 


ai a2,. ;b;,b2,. .. 


The order topology on X is not the discrete topology. Most one-point sets are open, but 
there is an exception—the one-point set {b1}. Any open set containing bı must contain a 
basis element about b, (by definition), and any basis element containing bı contains points 
of the a; sequence. 


Definition. If X is an ordered set, and a is an element of X, there are four subsets 
of X that are called the rays determined by a They are the following: 

(a, +00) = {x |x > a}, 

(—0o, a) = {x |x <a}, 

[a, +00) = {x | x > a}, 

(—00, a] = {x |x < a}. 
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Sets of the first two types are called open rays, and sets of the last two types are called 
closed rays. 


The use of the term “open” suggests that open rays in X are open sets in the order 
topology. And so they are. Consider, for example, the ray (a, +00). If X has a largest 
element bo, then (a, +00) equals the basis element (a, bo]. If X has no largest element, 
then (a, +00) equals the union of all basis elements of the form (a, x), for x > a. In 
either case, (a, +00) is open. A similar argument applies to the ray (—00, a). 

The open rays, in fact, form a subbasis for the order topology on X, as we now 
show Because the open rays are open in the order topology, the topology they gen- 
erate is contained in the order topology. On the other hand, every basis element for 
the order topology equals a finite intersection of open rays; the interval (a, b) equals 
the intersection of (—0o, b) and (a, +00), while [ag, b) and (a, bo], if they exist, are 
themselves open rays. Hence the topology generated by the open rays contains the 
order topology 


§15 The Product Topology on X x Y 


If X and Y are topological spaces, there is a standard way of defining a topology on 
the cartesian product X x Y. We consider this topology now and study some of its 
properties. 


Definition. Let X and Y be topological spaces. The product topology on X x Y is 
the topology having as basis the collection B of all sets of the form U x V, where U 
is an open subset of X and V is an open subset of Y. 


Let us check that B is a basis. The first condition is trivial, since X x Y is itself 
a basis element. The second condition is almost as easy, since the intersection of any 
two basis elements U; x V; and U2 x V2 is another basis element. For 


(Ui x Vi) NA (U2 x V2) = (U1 N U2) x (VN V2), 


and the latter set is a basis element because U; N U2 and Vi N V2 are open in X and Y, 
respectively. See Figure 15.1. 

Note that the collection B is not a topology on X x Y. The union of the two 
rectangles pictured in Figure 15.1, for instance, is not a product of two sets, so it 
cannot belong to 8; however, it is open in X x Y. 

Each time we introduce a new concept, we shall try to relate it to the concepts that 
have been previously introduced. In the present case, we ask: What can one say if the 
topologies on X and Y are given by bases? The answer is as follows: 


Theorem 15.1. If B is a basis for the topology of X and C is a basis for the topology 
of Y, then the collection 


D={BxC|BeBandC €C} 
is a basis for the topology of X x Y 
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Figure 15.1 


Proof. We apply Lemma 13.2. Given an open set W of X x Y and a point x x y 
of W, by definition of the product topology there is a basis element U x V such that 
xxy€EUxVC W. Because B and C are bases for X and Y, respectively, we can 
choose an element B of B such that x € B C U, and an element C of C such that 
y€CCV. Thenx x ye Bx CC W. Thus the collection D meets the criterion of 
Lemma 13.2, so D is a basis for X x Y. | 


EXAMPLE |. We have a standard topology on R: the order topology The product of 
this topology with itself is called the standard topology on R x R = R?. It has as basis 
the collection of all products of open sets of R, but the theorem just proved tells us that the 
much smaller collection of all products (a, b) x (c, d) of open intervals in R will also serve 
as a basis for the topology of R? Each such set can be pictured as the intenor of a rectangle 
in R?. Thus the standard topology on R? is just the one we considered in Example 2 of §13 


It is sometimes useful to express the product topology in terms of a subbasis. To 
do this, we first define certain functions called projections. 


Definition. Let 7; . x x Y — X be defined by the equation 
(x,y) =x; 

let x2 : X x Y — Y be defined by the equation 
m(x, y) =y. 


The maps xı and 72 are called the projections of X x Y onto its first and second 
factors, respectively. 


We use the word “onto” because 7, and x2 are surjective (unless one of the 
spaces X or Y happens to be empty, in which case X x Y is empty and our whole 
discussion is empty as well!). 

If U is an open subset of X, then the set ie (U) is precisely the set U x Y, which 
is open in X x Y. Similarly, if V is open in Y, then 


my '(V)=Xx V, 
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which is also open in X x Y. The intersection of these two sets is the set U x V, as 
indicated in Figure 15.2. This fact leads to the following theorem: 


Theorem 15.2. The collection 
S = (x '(U) | U open in X} U {x7 (V) | V open in Y} 


is a subbasis for the product topology on X x Y. 


Figure 15.2 


Proof. Let T denote the product topology on X x Y, let 7’ be the topology gener- 
ated by $. Because every element of $ belongs to J, so do arbitrary unions of finite 
intersections of elements of $. Thus 7’ C T. On the other hand, every basis element 
U x V for the topology T is a finite intersection of elements of $, since 


UxV=rn (UNV). 


Therefore, U x V belongs to 7’, so that 7 C 7’ as well a 


§16 The Subspace Topology 


Definition. Let X be a topological space with topology 7. If Y is a subset of X, the 
collection 


Ty ={¥YNU|U ET} 


is a topology on Y, called the subspace topology. With this topology, Y is called a 
subspace of X; its open sets consist of all intersections of open sets of X with Y. 
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It is easy to see that Fy is a topology. It contains Ø and Y because 
@=YNG and Y=YNX, 


where Ø and X are elements of 7. The fact that it is closed under finite iniersections 
and arbitrary unions follows from the equations 


(UL NY) N--A(U_ OY) = UN- -NUn NY, 
UWanry = (VU Ua) VY. 


ael acj 


Lemma 16.1. If 8 is a basis for the topology of X then the collection 
By ={BNY| BEB} 


is a basis for the subspace topology on Y. 


Proof. Given U open in X and given y € UN Y, we can choose an element B of B 
such that y € B C U. Then y e€ BAY CUNY. It follows from Lemma 13.2 that By 
is a basis for the subspace topology on Y. a 


When dealing with a space X and a subspace Y, one needs to be careful when 
one uses the term “open set”. Does one mean an element of the topology of Y or an 
element of the topology of X? We make the following definition : If Y is a subspace 
of X, we say that a set U is open in Y (or open relative to Y) if it belongs to the 
topology of Y; this implies in particular that it is a subset of Y. We say that U is open 
in X if it belongs to the topology of X 

There is a special situation in which every set open in Y is also open in X. 


Lemma 16.2. Let Y be a subspace of X. If U is open in Y and Y is openin X, then 
U is open in X. 

Proof. Since U is open in Y, U = Y N V for some set V open in X. Since Y and V 
are both open in X, so is Y A V | 


Now let us explore the relation between the subspace topology and the order and 
product topologies For product topologies, the result is what one might expect; for 
order topologies, it is not. 


Theorem 16.3. If A is a subspace of X and B is a subspace of Y, then the product 
topology on A x B is the same as the topology A x B inherits as a subspace of X x Y. 


Proof. The set U x V is the general basis element for X x Y, where U is open in X 
and V is open in Y. Therefore, (U x V)M(A x B) is the general basis element for the 
subspace topology on A x B. Now 


(U x V)N(A x B) = (UNA) x (VNB). 
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Since U N A and V N B are the general open sets for the subspace topologies on A 
and B, respectively, the set (U N A) x (V N B) is the general basis element for the 
product topology on A x B. 

The conclusion we draw is that the bases for the subspace topology on A x B and 
for the product topology on A x B are the same. Hence the topologies are the same. $ 


Now let X be an ordered set in the order topology, and let Y be a subset of X. The 
order relation on X, when restricted to Y, makes Y into an ordered set. However, the 
resulting order topology on Y need not be the same as the topology that Y inherits as 
a subspace of X. We give one example where the subspace and order topologies on Y 
agree, and two examples where they do not. 

EXAMPLE! Consider the subset Y = [0, 1] of the real line R, in the subspace topology. 

The subspace topology has as basis all sets of the form (a, b) N Y, where (a, b) is an open 

interval in R Such a set is of one of the following types: 


(a,b) ifaandbareinY, 
[0,b)  ifonly bisin Y, 

(a,1)  ifonlyaisinY, 

Yoro if neithera norb isin Y. 


(a,b)NY= 


By definition, each of these sets is open in Y But sets of the second and third types are not 
open in the larger space R. 

Note that these sets form a basis for the order topology on Y Thus, we see that in the 
case of the set Y = (0, 1), its subspace topology (as a subspace of R) and its order topology 
are the same. 


EXAMPLE 2 Let Y be the subset [0, 1) U {2} of R. in the subspace topology on Y the 
one-point set {2} is open, because it is the intersection of the open set G, 5) with Y Butin 
the order topology on Y, the set {2} is not open. Any basis element for the order topology 


on Y that contains 2 is of the form 
{x [x € Yanda <x <2) 


for some a € Y, such a set necessarily contains points of Y less than 2 


EXAMPLE 3 Let / = [0,1] The dictionary order on / x / is just the restnction to 
I x I of the dictionary order on the plane R x R. However, the dictionary order topology 
on / x / is not the same as the subspace topology on / x / obtained from the dictionary 
order topology on R x R! For example, the set {1/2} x (1/2, 1] is open in J x J in the 
subspace topology, but not in the order topology, as you can check. See Figure 16.1. 

The set / x / in the dictionary order topology will be called the ordered square, and 
denoted by 72. 


The anomaly illustrated in Examples 2 and 3 does not occur for intervals or rays 
in an ordered set X. This we now prove. 

Given an ordered set X, let us say that a subset Y of X is convex in X if for each 
pair of points a < b of Y, the entire interval (a, b) of points of X lies in Y. Note that 
intervals and rays in X are convex in X. 
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Subspace Order 


Figure 16.1 


Theorem 16.4. Let X be an ordered set in the order topology; let Y be a subset 
of X that is convex in X Then the order topology on Y is the same as the topology Y 
inherits as a subspace of X. 


Proof. Consider the ray (a, +00) in X. What is its intersection with Y? Ifa € Y, 
then 


(a, +00) NY = {x |x € Y and x > a}; 


this is an open ray of the ordered set Y. Ifa ¢ Y, then a is either a lower bound on Y 
or an upper bound on Y, since Y is convex. In the former case, the set (a, +00) N Y 
equals all of Y; in the latter case, it is empty. 

A similar remark shows that the intersection of the ray (—oo, a) with Y is either 
an open ray of Y, or Y itself, or empty. Since the sets (a, +00) N Y and (—00, a) N Y 
form a subbasis for the subspace topology on Y, and since each is open in the order 
topology, the order topology contains the subspace topology. 

To prove the reverse, note that any open ray of Y equals the intersection of an open 
ray of X with Y, so it is open in the subspace topology on Y. Since the open rays of Y 
are a subbasis for the order topology on Y, this topology is contained in the subspace 


topology. E 


To avoid ambiguity, let us agree that whenever X is an ordered set in the order 
topology and Y is a subset of X, we shall assume that Y is given the subspace topology 
unless we specifically state otherwise. If Y is convex in X, this is the same as the order 
topology on Y, otherwise, it may not be. 


Exercises 


1. Show that if Y is a subspace of X, and A is a subset of Y, then the topology A 
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inherits as a subspace of Y is the same as the topology it inherits as a subspace 
of X. 

If 7 and J’ are topologies on X and T” is stnetly finer than F, what can you 
say about the corresponding subspace topologies on the subset Y of X? 


. Consider the set Y = {—1, 1] as a subspace of R. Which of the following sets 


are open in Y? Which are open in R? 

A={x]4 < lx] <1}, 

B={x|} <k] <1, 

C={x|} <Įxl <1}, 

D={x]ġ skl <1), 

E =f{x|0 < |x] < Land 1/x ¢ Z4}. 
A map f : X —> Y is said to be an open map if for every open set U of X, the 
set f (U) is open in Y. Show that 7; : X x Y —> X and m : X x Y > Y are 
open maps. 
Let X and X’ denote a single set in the topologies 7 and 7’, respectively; let Y 
and Y’ denote a single set in the topologies U and U’, respectively. Assume 
these sets are nonempty. 
(a) Show that if 7’ > T and U’ D U, then the product topology on X’ x Y’ is 


finer than the product topology on X x Y. 
(b) Does the converse of (a) hold? Justify your answer. 


Show that the countable collection 
((a, b) x (c,d) | a < bandc < d, and a, b, c, d are rational} 


is a basis for R?. 


. Let X be an ordered set. If Y is a proper subset of X that is convex in X, does it 


follow that Y is an interval or a ray in X? 


. If L is a straight line in the plane, describe the topology L inherits as a subspace 


of Re x R and as a subspace of Re x Rg. In each case it is a familiar topology. 


. Show that the dictionary order topology on the set R x R is the same as the 


product topology Rg x R, where Rg denotes R in the discrete topology. Compare 
this topology with the standard topology on R?. 

Let 7 = [0, 1]. Compare the product topology on / x J, the dictionary order 
topology on I x I, and the topology 7 x I inherits as a subspace of R x R in the 
dictionary order topology. 


§17 Closed Sets and Limit Points 


Now that we have a few examples at hand, we can introduce some of the basic concepts 
associated with topological spaces. In this section, we treat the notions of closed set, 
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closure of a set, and limit point. These lead naturally to consideration of a certain 
axiom for topological spaces called the Hausdorff axiom. 


Closed Sets 
A subset A of a topological space X is said to be closed if the set X — A is open. 
EXAMPLE !. The subset [a, b} of R is closed because its complement 
R — [a, b} = (—00, a) U (b, +00), 
is open. Similarly, [a, +00) is closed, because its complement (—90, a) is open. These 


facts justify our use of the terms “closed interval” and “closed ray” The subset (a, b) of R 
is neither open nor closed. 


EXAMPLE 2. Inthe plane R?, the set 
{x x y | x > Oand y > 0} 
is closed, because its complement is the union of the two sets 
(-00,0)xR and Rx (—o0,0), 


each of which is a product of open sets of R and is, therefore, open in R? 


EXAMPLE 3 Inthe finite complement topology on a set X, the closed sets consist of X 
itself and all finite subsets of X 


EXAMPLE 4 In the discrete topology on the set X, every set is open; it follows that 
every set is closed as well. 


EXAMPLE 5 Consider the following subset of the real line: 
Y = (0, 1} U (2, 3), 


in the subspace topology. In this space, the set [0, 1] is open, since it is the intersection of 
the open set (-}, 3) of R with Y Similarly, (2, 3) is open as a subset of Y; it is even open 
as a subset of R. Since [0, 1) and (2, 3) are complements in Y of each other, we conclude 
that both [0, 1] and (2, 3) are closed as subsets of Y 


These examples suggest that an answer to the mathematician’s riddle: “How is 
a set different from a door?” should be: “A door must be either open or closed, and 
cannot be both, while a set can be open, or closed, or both, or neither!” 

The collection of closed subsets of a space X has properties similar to those satis- 
fied by the collection of open subsets of X: 


94 Topological Spaces and Continuous Functions Ch. 2 


Theorem 17.1. Let X be a topological space. Then the following conditions hold: 
(1) @ and X are closed. 
(2) Arbitrary intersections of closed sets are closed. 
(3) Finite unions of closed sets are closed. 


Proof. (1) Ø and X are closed because they are the complements of the open sets X 
and Ø, respectively. 
(2) Given a collection of closed sets {Ag}acys, we apply DeMorgan’s law, 


X -() Aa = (JX - Aa). 


aeJ aes 


Since the sets X — A, are open by definition, the right side of this equation represents 
an arbitrary union of open sets, and is thus open. Therefore, (| Aq is closed. 
(3) Similarly, if A; is closed fori = 1, ..., n, consider the equation 


x—(Jai =(\(x - 4». 
i=l i=l 


The set on the right side of this equation is a finite intersection of open sets and is 
therefore open. Hence |) A; is closed. a 


Instead of using open sets, one could just as well specify a topology on a space by 
giving a collection of sets (to be called “closed sets”) satisfying the three properties of 
this theorem. One could then define open sets as the complements of closed sets and 
proceed just as before. This procedure has no particular advantage over the one we 
have adopted, and most mathematicians prefer to use open sets to define topologies. 

Now when dealing with subspaces, one needs to be careful in using the term 
“closed set.” If Y is a subspace of X, we say that a set A is closed in Y if A isa 
subset of Y and if A is closed in the subspace topology of Y (that is, if Y — A is open 
in Y). We have the following theorem: 


Theorem 17.2. Let Y be a subspace of X. Then a set A is closed in Y if and only if 
it equals the intersection of a closed set of X with Y. 


Proof. Assume that A = C N Y, where C is closed in X. (See Figure 17.1.) Then 
X — C is open in X, so that (X — C) N Y is open in Y, by definition of the subspace 
topology. But (X — C)NA Y = Y —A. Hence Y — A is open in Y, so that A is closed in 
Y. Conversely, assume that A is closed in Y. (See Figure 17.2.) Then Y — A is open 
in Y, so that by definition it equals the intersection of an open set U of X with Y The 
set X — U is closed in X, and A = Y N (X — U), so that A equals the intersection of 
a closed set of X with Y, as desired. a 


A set A that is closed in the subspace Y may or may not be closed in the larger 
space X. As was the case with open sets, there is a critenon for A to be closed in X; 
we leave the proof to you: 
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Figure 17.1 Figure 17.2 


Theorem 17.3. Let Y be a subspace of X. If A is closed in Y and Y is closed in X, 
then A is closed in X. 


Closure and Interior of a Set 


Given a subset A of a topological space X, the interior of A is defined as the union of 
all open sets contained in A, and the closure of A is defined as the intersection of all 
closed sets containing A. 

The interior of A is denoted by Int A and the closure of A is denoted by C? A or 
by A. Obviously Int A is an open set and A is a closed set; furthermore, 


IntACACA. 


If A is open, A = Int A; while if A is closed, A = A. 

We shall not make much use of the interior of a set, but the closure of a set will be 
quite important. 

When dealing with a topological space X and a subspace Y, one needs to exercise 
care in taking closures of sets If A is a subset of Y, the closure of A in Y and the 
closure of A in X will in general be different in such a situation, we reserve the 
notation A to stand for the closure of A in X. The closure of A in Y can be expressed 
in terms of A, as the following theorem shows: 


Theorem 17.4. Let Y be a subspace of X, let A be a subset of Y, let A denote the 
closure of A in X. Then the closure of A in Y equals ANY. 


Proof. Let B denote the closure of A in Y. The set A is closed in X, so AMY is 

closed in Y by Theorem 17.2. Since ANY contains A, and since by definition: B equals 
the intersection of all closed subsets of Y containing A, we must have B C (ANY). 

On the other hand, we know that B is closed in Y. Hence by Theorem 17.2, 

= CA Y for some set C closed in X. Then C is a closed set of X containing A; 

because A is the intersection of al! such closed sets, we conclude that A C C. Then 

(ANY) C(CNY)=B. = 
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The definition of the closure of a set does not give us a convenient way for actually 
finding the closures of specific sets, since the collection of all closed sets in X, like 
the collection of all open sets, is usually much too big to work with. Another way of 
describing the closure of a set, useful because it involves only a basis for the topology 
of X, is given in the following theorem. 

First let us introduce some convenient terminology. We shall say that a set A 
intersects a set B if the intersection A N B is not empty. 


Theorem 17.5. Let A be a subset of the topological space X. 
(a) Then x € A if and only if every open set U containing x intersects A. 
(b) Supposing the topology of X is given by a basis, then x € A if and only if every 
basis element B containing x intersects A. 


Proof. Consider the statement in (a). It is a statement of the form P & Q. Let 
us transform each implication to its contrapositive, thereby obtaining the logically 
equivalent statement (not P) < (not Q). Written out, it is the following: 


x ¢ A <> there exists an open set U containing x that does not intersect A. 


In this form, our theorem is easy to prove. If x is not in A, the set U = X — A is an 
open set containing x that does not intersect A, as desired Conversely, if there exists 
an open set U containing x which does not intersect A, then X — U is a closed set 
containing A By definition of the closure A, the set X — U must contain A, therefore, 
x cannot be in A. 

Statement (b) follows readily If every open set containing x intersects A, so does 
every basis element B containing x, because B is an open set. Conversely, if every 
basis element containing x intersects A, so does every open set U containing x, be- 
cause U contains a basis element that contains x. a 


Mathematicians often use some special terminology here. They shorten the state- 
ment “U is an open set containing x” to the phrase 


“U is a neighborhood of x.” 
Using this terminology, one can write the first half of the preceding theorem as follows: 


If A is a subset of the topological space X, then x € A if and only if every 
neighborhood of x intersects A. 


EXAMPLE 6 Let X be the real line R. If A = (0, 1}, then A = (0,1), for every 
neighborhood of 0 intersects A, while every point outside [0, 1} has a neighborhood disjoint 
from A Similar arguments apply to the following subsets of X 7 

If B = {1/n | n € Z), then B = {0} UB If C = {0} U (1, 2), then Ĉ = (0} U [1, 2) 
If Q is the set of rational numbers, then Q = R If Z4 is the set of positive integers, then 
Z, = Z,. If Ry is the set of positive reals, then the closure of R, is the set R4 U {0}. 
(This is the reason we introduced the notation R, for the set Ry U {0}, back in §2 ) 
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EXAMPLE7 Consider the subspace Y = (0, 1] of the real line R. The set A = (0, 5) is 
a subset of Y, its closure in R ıs the set (0, 3], and its closure in Y is the set [0, 3} NY= 
(0, 5] 


Some mathematicians use the term “neighborhood” differently. They say that A 
is a neighborhood of x if A merely contains an open set containing x. We shall not 
follow this practice. 


Limit Points 


There is yet another way of describing the closure of a set, a way that involves the 
important concept of limit point, which we consider now. 

If A is a subset of the topological space X and if x is a point of X, we say that x is a 
limit point (or “cluster point,” or “point of accumulation”) of A if every neighborhood 
of x intersects A in some point other than x itself. Said differently, x is a limit point 
of A if it belongs to the closure of A — {x} The point x may lie in A or not; for this 
definition it does not matter. 


EXAMPLE 8 Consider the real line R. If A = (0, 1), then the point 0 is a limit point 
of A and so is the point 5 In fact, every point of the interval [0, 1} is a limit point of A, but 
no other point of R is a limit point of A 

If B = {1/n|n € Z4}, then 0 is the only limt point of B. Every other point x of R has 
a neighborhood that either does not intersect B at all, or it intersects B only in the point x 
itself. If C = {0} U (1, 2), then the limit points of C aie the points of the interval [!, 2]. If 
Q is the set of rational numbers, every point of R is a limit point of Q. If Z, is the set of 
positive integers, no point of R is a limit point of Z+} If R, is the set of positive reals, then 
every point of {0} U R+ is a limit point of Ry 


Comparison of Examples 6 and 8 suggests a relationship between the closure of a 
set and the limit points of a set. That relationship is given in the following theorem: 


Theorem 17.6. Let A be a subset of the topological space X, let A’ be the set of all 
limit points of A. Then 


A=AUA’, 


Proof. If x isin A’, every neighborhood of x intersects A (in a point different from x). 
Therefore, by Theorem 17.5, x belongs to A Hence A’ C A. Since by definition 
ACA, it follows that AU A’ C A. 

To demonstrate the reverse inclusion, we let x be a point of A and show that 
x € AUA. If x happens to lie in A, it is trivial that x € A U A’; suppose that x 
does not lie in A. Since x € A, we know that every neighborhood U of x intersects A; 
because x ¢ A, the set U must intersect A in a point different from x. Then x € A’, 
so that x € AU A’, as desired. a 
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Corollary 17.7. A subset of a topological space is closed if and only if it contains all 
its limit points. 


Proof. The set A is closed if and only if A = A, and the latter holds if and only if 
A'CA. a 


Hausdorff Spaces 


One’s experience with open and closed sets and limit points in the real line and the 
plane can be misleading when one considers more general topological spaces. For 
example, in the spaces R and R?, each one-point set {xo} is closed. This fact is easily 
proved; every point different from xo has a neighborhood not intersecting {xo}, so 
that {xo} is its own closure. But this fact is not true for arbitrary topological spaces. 
Consider the topology on the three-point set {a, b, c} indicated in Figure 17.3. In this 
space, the one-point set {b} is not closed, for its complement is not open. 


Gop 


Figure 17.3 


Similarly, one’s experience with the properties of convergent sequences in R and 
R? can be misleading when one deals with more general topological spaces. In an 
arbitrary topological space, one says that a sequence x4, x2, ... of points of the space 
X converges to the point x of X provided that, corresponding to each neighborhood U 
of x, there is a positive integer N such that x, € U for alln > N. In R and R?, a 
sequence cannot converge to more than one point, but in an arbitrary space, it can. In 
the space indicated in Figure 17.3, for example, the sequence defined by setting x, = b 
for all n converges not only to the point b, but also to the point a and to the point c! 

Topologies in which one-point sets are not closed, or in which sequences can con- 
verge to more than one point, are considered by many mathematicians to be somewhat 
strange. They are not really very interesting, for they seldom occur in other branches 
of mathematics And the theorems that one can prove about topological spaces are 
rather limited if such examples are allowed. Therefore, one often imposes an addi- 
tional condition that will rule out examples like this one, bringing the class of spaces 
under consideration closer to those to which one’s geometric intuition applies. The 
condition was suggested by the mathematician Felix Hausdorff, so mathematicians 
have come to call it by his name. 


Definition. A topological space X is called a Hausdorff space if for each pair x1, x2 
of distinct points of X, there exist neighborhoods U1, and U2 of x, and x2, respectively, 
that are disjoint. 
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Theorem 17.8. Every finite point set in a Hausdorff space X is closed. 


Proof. It suffices to show that every one-point set {xo} is closed. If x is a point of X 
different from xo, then x and xo have disjoint neighborhoods U and V, respectively. 
Since U does not intersect {xo}, the point x cannot belong to the closure of the set {xo}. 
As a result, the closure of the set {xo} is {xo} itself, so that it is closed. E 


The condition that finite point sets be closed is in fact weaker than the Hausdorff 
condition For example, the real line R in the finite complement topology is not a 
Hausdorff space, but it is a space in which finite point sets are closed The condition 
that finite point sets be closed has been given a name of its own: it is called the T} ax- 
iom. (We shall explain the reason for this strange terminology in Chapter 4.) The 
Tı axiom will appear in this book in a few exercises, and in just one theorem, which is 
the following: 


Theorem 17.9. Let X be a space satisfying the T, axiom; let A be a subset of X. 
Then the point x is a limit point of A if and only if every neighborhood of x contains 
infinitely many points of A. 


Proof. If every neighborhood of x intersects A in infinitely many points, it certainly 

intersects A in some point other than x itself, so that x is a limit point of A 
Conversely, suppose that x is a limit point of A, and suppose some neighbor- 

hood U of x intersects A in only finitely many points. Then U also intersects A — {x} 


in finitely many points; let {x1,. ,Xm} be the points of U N (A — {x}). The set 
X — {X1,..., Xm} is an open set of X, since the finite point set (x1, ..., Xm} is closed; 
then 


UN(X — (x1, -., Xm) 


is a neighborhood of x that intersects the set A — {x} not at all. This contradicts the 
assumption that x is a limit point of A. a 


One reason for our lack of interest in the T, axiom is the fact that many of the 
interesting theorems of topology require not just that axiom, but the full strength of 
the Hausdorff axiom. Furthermore, most of the spaces that are important to mathe- 
maticians are Hausdorff spaces. The following two theorems give some substance to 
these remarks. 


Theorem 17.10. If X is a Hausdorff space, then a sequence of points of X converges 
to at most one point of X 


Proof. Suppose that x, is a sequence of points of X that converges tox If y Æx, 
let U and V be disjoint neighborhoods of x and y, respectively. Since U contains xp, 
for all but finitely many values of n, the set V cannot Therefore, x, cannot converge 
to y. a 
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If the sequence x, of points of the Hausdorff space X converges to the point x 
of X, we often write x, —> x, and we say that x is the limit of the sequence xp. 
The proof of the following result is left to the exercises. 


Theorem 17.11. Every simply ordered set is a Hausdorff space in the order topology. 
The product of two Hausdorff spaces is a Hausdorff space. A subspace of a Hausdorff 
space is a Hausdorff space. 


The Hausdorff condition is generally considered to be a very mild extra condition 
to impose on a topological space. Indeed, in a first course in topology some mathe- 
maticians go so far as to impose this condition at the outset, refusing to consider spaces 
that are not Hausdorff spaces. We shall not go this far, but we shall certainly assume 
the Hausdorff condition whenever it is needed in a proof without having any qualms 
about limiting seriously the range of applications of the results. 

The Hausdorff condition is one of a number of extra conditions one can impose on 
a topological space. Each time one imposes such a condition, one can prove stronger 
theorems, but one limits the class of spaces to which the theorems apply. Much of the 
research that has been done in topology since its beginnings has centered on the prob- 
lem of finding conditions that will be strong enough to enable one to prove interesting 
theorems about spaces satisfying those conditions, and yet not so strong that they limit 
severely the range of applications of the results. 

We shall study a number of such conditions ın the next two chapters. The Haus- 
dorff condition and the 7; axiom are but two of a collection of conditions similar to one 
another that are called collectively the separation axioms. Other conditions include the 
countability axioms, and various compactness and connectedness conditions. Some of 
these are quite stringent requirements, as you will see. 


Exercises 


1. Let C be a collection of subsets of the set X. Suppose that Ø and X are in C, 
and that finite unions and arbitrary intersections of elements of C are in C. Show 
that the collection 


F =(X-C|Cee} 


is a topology on X. 

2. Show that if A is closed in Y and Y is closed in X, then A is closed in X. 

3. Show that if A is closed in X and B is closed in Y, then A x B is closed in X x Y. 

4. Show that if U is open in X and A is closed in X, then U — A is open in X, and 
A — U is closed in X. 

5. Let X be an ordered set in the order topology. Show that (a, b) C [a, b}. Under 
what conditions does equality hold? 


§17 


17. 


18. 
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. Let A, B, and Ag denote subsets of a space X. Prove the following: 


(a) IFAC B, then A cB. 
b) AUB=AUB. 
(c) U Aa D U Aa: give an example where equality fails. 


. Criticize the following “proof” that LJ Aa C U Aa: if {Aa} is a collection of 


sets in X and if x € (J) Aq, then every neighborhood U of x intersects |) Ag. 
Thus U must intersect some Aq, SO that x must belong to the closure of some Ag. 
Therefore, x € J Ag. 


. Let A, B, and Ay denote subsets of a space X. Determine whether the following 


equations hold; if an equality fails, determine whether one of the inclusions D 
or C holds. — 
(a) ANB=ANB. 

Aa 


(b) =[] Aa 
(c) =A-B. 
Let A oe B C Y. Show that in the space X x Y, 


AxB=AxB. 


. Show that every order topology is Hausdorff. 

. Show that the product of two Hausdorff spaces is Hausdorff. 

. Show that a subspace of a Hausdorff space is Hausdorff. 

. Show that X is Hausdorff if and only if the diagonal A = (x x x | x € X} is 


closed in X x X. 


. In the finite complement topology on R, to what point or points does the se- 


quence x, = 1/n converge? 


. Show the T; axiom is equivalent to the condition that for each pair of points of X, 


each has a neighborhood not containing the other. 


. Consider the five topologies on R given in Exercise 7 of §13. 


(a) Determine the closure of the set K = {1/n | n € Z4} under each of these 
topologies. 
(b) Which of these topologies satisfy the Hausdorff axiom? the T) axiom? 
Consider the lower limit topology on R and the topology given by the basis € 
of Exercise 8 of §13. Determine the closures of the intervals A = (0, V2) and 
= (V2, 3) in these two topologies. 
Determine the closures of the following subsets of the ordered square: 
A = {(1/n) x O[n € Z4}, 

B={(1—l/n) x4 |n € Z), 

C=fxx0]0<x<l}, 

={xx}]0<x<l)}, 


E=ļ{ġxyl0<y<1)}. 
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19. If A C X, we define the boundary of A by the equation 
BdA = ÅN (X-A). 


(a) Show that Int A and Bd A are disjoint, and A = Int A U Bd A. 

(b) Show that Bd A = Ø A is both open and closed. 

(c) Show that U is open & BdU =U -U. 

(d) If U ıs open, is it true that U = Int(U)? Justify your answer. 
20. Find the boundary and the interior of each of the following subsets of R?- 

(a) A={x xy] y =0} 

(b) B = {x x y| x > Oand y #0} 

(c) C=AUB 

(d) D = {x x y | x is rational} 

(e) E={xxylO<x?-y? <l} 

(f) F={x x y|x #Oandy < 1/x} 

*21. (Kuratowski) Consider the collection of all subsets A of the topological space X. 
The operations of closure A — A and complementation A —> X — A are func- 
tions from this collection to itself. 

(a) Show that starting with a given set A, one can form no more than 14 distinct 
sets by applying these two operations successively. 

(b) Find a subset A of R (in its usual topology) for which the maximum of 14 is 
obtained 


§18 Continuous Functions 


The concept of continuous function is basic to much of mathematics. Continuous 
functions on the real line appear in the first pages of any calculus book, and continuous 
functions in the plane and in space follow not far behind. More general kinds of 
continuous functions arise as one goes further in mathematics. In this section, we shall 
formulate a definition of continuity that will include all these as special cases, and we 
shall study various properties of continuous functions. Many of these properties are 
direct generalizations of things you learned about continuous functions in calculus and 
analysis. 


Continuity of a Function 


Let X and Y be topological spaces. A function f : X — Y is said to be continuous if 
for each open subset V of Y, the set f~!(V) is an open subset of X. 

Recall that f~'(V) is the set of all points x of X for which f(x) € V; itis empty 
if V does not intersect the image set f (X) of f. 

Continuity of a function depends not only upon the function f itself, but also on 
the topologies specified for its domain and range. If we wish to emphasize this fact, 
we can say that f is continuous relative to specific topologies on X and Y. 
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Let us note that if the topology of the range space Y is given by a basis B, then to 
prove continuity of f it suffices to show that the inverse image of every basis element 
is open. The arbitrary open set V of Y can be wntten as a union of basis elements 


V =|] Ba. 


aed 
Then 


FV) = (] Ba), 


aes 


so that f~'(V) is open if each set f~'( By) is open. 

If the topology on Y is given by a subbasis S, to prove continuity of f it will even 
suffice to show that the inverse image of each subbasis element is open. The arbitrary 
basis element B for Y can be written as a finite intersection S} N--- S, of subbasis 
elements; it follows from the equation 


F!B) = FSD NA -N f-7'(S,) 


that the inverse image of every basis element is open. 


EXAMPLE | Let us consider a function like those studied in analysis, a “real-valued 
function of a real variable,” 


f R—R. 


In analysis, one defines continuity of f via the “e-d definition,” a bugaboo over the years 
for every student of mathematics. As one would expect, the e-ô definition and ours are 
equivalent To prove that our definition implies the e-ô definition, for instance, we proceed 
as follows: 

Given xo in R, and given € > 0, the interval V = ( f(xo) —€, f(xo) +€) is an open set 
of the range space R Therefore, f—!(V) is an open set in the domain space R. Because 
fT! (V) contains the point xo, it contains some basis element (a, b) about xo We choose 5 
to be the smaller of the two numbers xo — a and b — xq Then if |x — xo] < ô, the point x 
must be in (a, b), so that f(x) € V, and | f(x) — f (xo)| < €, as desired. 

Proving that the €-5 definition implies our definition is no harder, we leave it to you. 
We shall return to this example when we study metric spaces 


EXaMPLE 2. In calculus one considers the property of continuity for many kinds of 
functions. For example, one studies functions of the following types: 

f.R — R? (curves in the plane) 

f.R — R (urvesin space) 

f R? —R (functions f(x, y) of two real vanables) 

f: R —R (functions f(x, y, z) of three real variables) 

fs R? — R? (vector fields v(x, y) in the plane). 


Each of them has a notion of continuity defined for it. Our general definition of continuity 
includes all these as special cases; this fact will be a consequence of general theorems we 
shall prove concerning continuous functions on product spaces and on metric spaces. 
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EXAMPLE 3 Let R denote the set of real numbers in its usual topology, and let Re 
denote the same set in the lower limit topology. Let 


f RR 


be the identity function; f(x) = x for every real number x. Then f is not a continuous 
function; the inverse image of the open set fa, b} of Re equals itself, which is not open 
in R. On the other hand, the identity function 


g’'Re—R 
is continuous, because the inverse image of (a, b) is itself, which is open in Re. 


In analysis, one studies several different but equivalent ways of formulating the 
definition of continuity. Some of these generalize to arbitrary spaces, and they are 
considered in the theorems that follow. The familiar “e-ô” definition and the “con- 
vergent sequence definition” do not generalize to arbitrary spaces; they will be treated 
when we study metric spaces. 


Theorem 18.1. Let X and Y be topological spaces; let f : X — Y. Then the 
following are equivalent: 
(1) f is continuous. 
(2) For every subset A of X, one has f(A) G F(A). 
(3) For every closed set B of Y, the set f7'(B) is closed in X 
(4) For each x € X and each neighborhood V of f(x), there is a neighborhood U 
of x such that f(U) C V. 


If the condition in (4) holds for the point x of X, we say that f is continuous at 
the point x. 
Proof. We show that (1) = (2) = (3) = (1) and that (1) = (4) = (1). 

(1) = (2). Assume that f is continuous. Let A be a subset of X. We show that if 
x € A, then f(x) e f(A). Let V be a neighborhood of f(x). Then f7 '(V) is an open 
set of X containing x; it must intersect A in some point y. Then V intersects f(A) in 
the point f(y), so that f(x) € f(A), as desired. 

(2) => (3). Let B be closed in Y and let A = f7 I(B). We wish to prove that A 
is closed in X; we show that A = A. By elementary set theory, we have f(A) = 
fCf7"(B)) C B. Therefore, if x € A, 


fœe f(c f(A) CB=B, 


so that x € f (B) = A. Thus Å C A, so that A = A, as desired. 
(3) => (1). Let V be an open set of Y. Set B = Y — V. Then 


FB) = fl) - fV) = X- f~). 


Now B is a closed set of Y. Then f7! (B) is closed in X by hypothesis, so that f~'(V) 
is open in X, as desired. 
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(1) = (4). Let x € X and let V be a neighborhood of f(x). Then the set 
U = fT} (V) is a neighborhood of x such that f(U) C V. 

(4) = (1). Let V be an open set of Y; let x be a point of f—'(V) Then f(x) € V, 
so that by hypothesis there is a neighborhood Ux of x such that f(U,) C V. Then 
Us C f7"(V). It follows that f~!(V) can be written as the union of the open sets Ux, 
so that it is open. a 


Homeomorphisms 


Let X and Y be topological spaces; let f . X — Y bea bijection. If both the function f 
and the inverse function 


fol:ysx 


are continuous, then f is called a homeomorphism. 

The condition that f -l be continuous says that for each open set U of X, the 
inverse image of U under the map f7! : Y — X is open in Y But the inverse 
image of U under the map f~! is the same as the image of U under the map f. See 
Figure 18.1. So another way to define a homeomorphism is to say that it is a bijective 
correspondence f : X — Y such that f(U) is open if and only if U is open. 


Figure 18.1 


This remark shows that a homeomorphism f : X — Y gives us a bijective cor- 
respondence not only between X and Y but between the collections of open sets of X 
and of Y. Asa result, any property of X that is entirely expressed in terms of the topol- 
ogy of X (that is, in terms of the open sets of X) yields, via the correspondence f, the 
corresponding property for the space Y. Such a property of X is called a topological 
property of X. 

You may have studied in modern algebra the notion of an isomorphism between al- 
gebraic objects such as groups or nngs. An isomorphism is a bijective correspondence 
that preserves the algebraic structure involved. The analogous concept in topology is 
that of homeomorphism; it is a bijective correspondence that preserves the topological 
structure involved. 

Now suppose that f : X — Y is an injective continuous map, where X and Y 
are topological spaces. Let Z be the image set f (X), considered as a subspace of Y; 
then the function f’: X — Z obtained by restricting the range of f is bijective. If f’ 
happens to be a homeomorphism of X with Z, we say that the map f : X — Y isa 
topological imbedding, or simply an imbedding, of X in Y. 
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EXAMPLE 4. The function f . R > R given by f(x) = 3x + 1 is a homeomorphism 
See Figure 18 2. If we define g ` R — R by the equation 


i 
8) = 5 - 1) 


then one can check easily that f(g(y)) = y and g( f (x)) = x for all reai numbers x and y 
It follows that f is bijective and that g = f ~}, the continuity of f and g is a familiar result 
from calculus. 


EXAMPLE 5. The function F . (—1, 1) —> R defined by 
x 
F(x) = — 
(x) j= 


is ahomeomorphism See Figure 18.3 We have already noted in Example 9 of §3 that F 
is a bijective order-preserving correspondence; its inverse is the function G defined by 


2y 


OO = aay 


The fact that F is a homeomorphism can be proved in two ways One way is to note that 
because F is order preserving and bijective, F carnes a basis element for the order topology 
in (—1, 1) onto a basis element for the order topology in R and vice versa As a result, F is 
automatically a homeomorphism of (—1, 1) with R (both in the order topology) Since the 
order topology on (—1, 1) and the usual (subspace) topology agree, F is a homeomorphism 
of (—1, 1) with R 


f(x) = 3x+1 


Figure 18.2 Figure 18.3 


A second way to show F a homeomorphism is to use the continuity of the algebraic 
functions and the square-root function to show that both F and G are continuous These 
are familiar facts from calculus 


EXAMPLE6_ A bijective function f . X — Y can be continuous without being a home- 
omorphism One such function is the identity map g R: — R considered in Example 3 
Another is the following Let S! denote the unit circle, 


Siatrxy|P+y=]j, 
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considered as a subspace of the plane R?, and let 
F-(0,.1I)— Ss! 


be the map defined by f(t) = (cos 21, sin2zt). The fact that f is bijective and continu- 
ous follows from famuliar properties of the tngonometnc functions. But the function f~! 
is not continuous The image under f of the open set U = (0, D of the domain, for in- 
stance, is not open in S', for the point p = f (0) lies in no open set V of R? such that 
V NS! c f(U). See Figure 18.4. 


t(U) 
u f 
e —* P 
ot 1 
4 
Figure 18.4 


EXAMPLE 7. Consider the function 
g: (0,1) — R? 


obtained from the function f of the preceding example by expanding the range. The map g 
is an example of a continuous injective map that is not an imbedding 


Constructing Continuous Functions 


How does one go about constructing continuous functions from one topological space 
to another? There are a number of methods used in analysis, of which some generalize 
to arbitrary topological spaces and others do not. We study first some constructions 
that do hold for general topological spaces, deferring consideration of the others until 
later. 


Theorem 18.2 (Rules for constructing continuous functions). Let X,Y, and Z be 
topological spaces. 
(a) (Constant function) If f . X —> Y maps all of X into the single point yo of Y, 
then f is continuous. 
(b) (Inclusion) If A is a subspace of X, the inclusion function j : A —> X is contin- 
uous. 
(c) (Composites) If f : X — Y and g : Y — Z are continuous, then the map 
go f: X — Z is continuous. 


108 Topological Spaces and Continuous Functions Ch. 2 


(d) (Restricting the domain) If f : X — Y is continuous, and if A is a subspace 
of X, then the restricted function f|A © A — Y is continuous. 

(e) (Restricting or expanding the range) Let f © X — Y be continuous. If Z is a 
subspace of Y containing the image set f(X), then the function g : X —> Z 
obtained by restricting the range of f is continuous. If Z is a space having Y as 
a subspace, then the function h : X — Z obtained by expanding the range of f 
is continuous. 

(£) (Local formulation of continuity) The map f : X — Y is continuous if X can be 
written as the union of open sets U, such that f |Ua is continuous for each a. 


Proof. (a) Let f(x) = yo for every x in X. Let V be open in Y. The set f~!(V) 
equals X or Ø, depending on whether V contains yo or not. In either case, it is open. 
(b) If U is open in X, then j7'(U) = UNA, which is open in A by definition of 
the subspace topology. 
(c) If U is open in Z, then g~!(U) is open in Y and f~'(g~'(U)) is open in X. 
But 


fiT U) = (go fy"), 


by elementary set theory. 

(d) The function f [A equals the composite of the inclusion map j : A —> X and 
the map f : X — Y, both of which are continuous. 

(e) Let f : X — Y be continuous. If f(X) C Z C Y, we show that the function 
2 : X — Z obtained from f is continuous. Let B be open in Z. Then B = Z NU for 
some open set U of Y. Because Z contains the entire image set f (X), 


f~! (U) = g7'(B), 


by elementary set theory. Since fo (U) is open, so is go! (B). 

To show h : X — Z is continuous if Z has Y as a subspace, note that h is the 
composite of the map f : X — Y and the inclusion map j : Y > Z. 

(f) By hypothesis, we can wnte X as a union of open sets Uy, such that f|U, is 
continuous for each a. Let V be an open set in Y. Then 


FV) N Ua = (f(a) (V), 


because both expressions represent the set of those points x lying in Uy for which 
f(x) € V. Since f|Ug is continuous, this set is open in Ug, and hence openin X But 


FW) =U) aU), 


so that f~!(V) is also open in X. m 


Theorem 18.3 (The pasting lemma). Let X = A U B, where A and B are closed 
in X. Let f : A — Y andg : B — Y be continuous. If f(x) = g(x) for every 
x € AN B, then f and g combine to give a continuous function h : X — Y , defined 
by setting h(x) = f(x) ifx € A, and h(x) = g(x) ifx € B. 
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Proof. Let C be a closed subset of Y. Now 
A(C) = fC) U 871C), 


by elementary set theory. Since f is continuous, f7! (C) is closed in A and, therefore, 
closed in X. Similarly, g7} (C) is closed in B and therefore closed in X. Their union 
ht! (C) is thus closed in X. a 


This theorem also holds if A and B are open sets in X; this is just a special case of 
the “local formulation of continuity” rule given in preceding theorem. 


EXAMPLE 8 Let us define a function A : R + R by setting 


x forx <0, 
A(x) = 
x/2 forx >0 


Each of the “pieces” of this definition is 2 continuous function, and they agree on the 
overlapping part of their domains, which is the one-point set {0}. Since their domains are 
closed in R, the function A is continuous. One needs the “pieces” of the function to agree 
on the overlapping part of their domains in order to have a function at all. The equations 


x—2 forx <0, 
k(x} = 
x+2 fox>0, 


for instance, do not define a function On the other hand, one needs some limitations on 
the sets A and B to guarantee continuity. The equations 


x-2 forx <0, 


{ = 
() x+2 forx >0, 


for instance, do define a function { mapping R into R, and both of the pieces are continuous. 
But / is not continuous; the inverse image of the open set (1, 3), for instance, is the nonopen 
set [0, i) See Figure 18.5 


Figure 18.5 
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Theorem 18.4 (Maps into products). Let f : A — X x Y be given by the equation 
f(a) = (fi (a), fala)). 
Then f is continuous if and only if the functions 
fi-A—>X ad fh:4—Y 


are continuous. 


The maps fı and fz are called the coordinate functions of f. 
Proof. Letn. X x Y — X and m : X x Y — Y be projections onto the first and 
second factors, respectively. These maps are continuous. For 7, lu )=U xY and 
my '(V) = X x V, and these sets are open if U and V are open. Note that for each 
aéA, 


fila)=m(f(a)) and fy(a) = m2(f(a)). 


If the function f is continuous, then fı and fz are composites of continuous func- 
tions and therefore continuous. Conversely, suppose that f; and fz are continuous. We 
show that for each basis element U x V for the topology of X x Y, its inverse image 
fT'(U x V) is open. A point a isin f~'(U x V) if and only if f(a) € U x V, that 
is, if and only if fı (a) € U and f2(a) € V. Therefore, 


FU x V) = fag). 
Since both of the sets T'U) and fy. (V) are open, so is their intersection. a 


There is no useful criterion for the continuity of a map f : A x B — X whose 
domain is a product space. One might conjecture that f is continuous if it is continuous 
“in each vartable separately,” but this conjecture 1s not true. (See Exercise 12.) 


EXAMPLE9 In calculus, a parametrized curve in the plane is defined to be a continuous 
map f [a,b] —> R? It is often expressed in the form f(t) = (x(t), y(t)); and one 
frequently uses the fact that f is a continuous function of ¢ if both x and y are Similarly, 
a vector field in the plane 


v(x, y) = P(x, yi + Q(x, yj 
= (P(x, y), Q(x, y)) 


is said to be continuous if both P and Q are continuous functions, or equivalently, if v is 
continuous as a map of R? into R?. Both of these statements are simply special cases of 
the preceding theorem. 


One way of forming continuous functions that is used a great deal in analysis is to 
take sums, differences, products, or quotients of continuous real-valued functions. It 
is a standard theorem that if f.g . X —> R are continuous, then f + g, f — g, and 
fg are continuous, and f/g is continuous if g(x) Æ 0 for all x. We shall consider 
this theorem in §21. 
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Yet another method for constructing continuous functions that 1s familiar from 
analysis is to take the limit of an infinite sequence of functions. There is a theorem to 
the effect that if a sequence of continuous real-valued functions of a real variable con- 
verges uniformly to a limit function, then the limit function is necessarily continuous. 
This theorem is called the Uniform Limit Theorem. It is used, for instance, to demon- 
strate the continuity of the tngonometnc functions, when one defines these functions 
ngorously using the infinite senes definitions of the sine and cosine. This theorem 
generalizes to a theorem about maps of an arbitrary topological space X into a metric 
space Y. We shail prove it in §21. 


Exercises 


1. Prove that for functions f . R — R, the e-ô definition of continuity implies the 
open set definition. 

2. Suppose that f : X — Y is continuous. If x is a limit point of the subset A of X, 
is it necessarily true that f (x) is a limit point of f(A)? 

3. Let X and X’ denote a single set in the two topologies T and J’, respectively. 
Let i : X’ — X be the identity function. 
(a) Show that i is continuous ¢ J‘ is finer than 7. 
(b) Show that i is a homeomorphism & 7’ = 7. 

4. Given xo € X and yo € Y, show that the maps f : X > X x Y andg : Y > 
X x Y defined by 


f(x)=xxyo and g(y)=x xy 


are imbeddings. 

5. Show that the subspace (a, b) of R is homeomorphic with (0, 1) and the subspace 
[a, b] of R is homeomorphic with (0, 1] 

6. Find a function f : R — R that is continuous at precisely one point. 

7. (a) Suppose that f . R — R is “continuous from the nght,” that is, 


lim, f@) = f(a), 


for each a € R. Show that f is continuous when considered as a function 
from Rz to R. 

(b) Can you conjecture what functions f - R — R are continuous when con- 
sidered as maps from R to Ry? As maps from Rg to Rz? We shall return to 
this question in Chapter 3. 


8. Let Y be an ordered set in the order topology. Let f, g : X — Y be continuous. 
(a) Show that the set {x | f(x) < g(x)} is closed in X 


112 Topological Spaces and Continuous Functions Ch. 2 


(b) Leth : X — Y be the function 
h(x) = min{ f (x), g(+)}. 


Show that A is continuous [Hint: Use the pasting lemma.] 
9. Let {Aq} be a collection of subsets of X; let X = J, Aa. Let f : X > Y; 
suppose that f|Aq, is continuous for each a. 
(a) Show that if the collection {Aq} is finite and each set Ay is closed, then f is 
continuous. 
(b) Find an example where the collection {Aq} is countable and each Ag is 
closed, but f is not continuous. 
(c) An indexed family of sets {Ag} is said to be locally finite if each point x 
of X has a neighborhood that intersects Ag for only finitely many values of 
a. Show that if the family {Ag} is locally finite and each Ag is closed, then 
f is continuous. 
10. Let f - A — Band g : C — D be continuous functions. Let us define a map 
fxg:AxC — Bx Dby the equation 


(f x g)(a x c) = f(a) x g(c). 


Show that f x g is continuous. 

11. Let F : X xY — Z. We say that F is continuous in each variable separately if 
for each yo in Y, the map h : X — Z defined by A(x) = F(x x yo) is continuous, 
and for each xg in X, the map k ` Y — Z defined by k(y) = F(x x y) is 
continuous. Show that if F is continuous, then F is continuous in each vanable 
separately. 


12. Let F . R x R — R be defined by the equation 


F( ) xy/+y*) ifxxy#0x0. 

XX = 
X= 16 ifx x y=0x0 
(a) Show that F is continuous in each vanable separately. 
(b) Compute the function g : R — R defined by g(x) = F(x x x). 
(c) Show that F is not continuous 

13. Let A C X; let f : A — Y be continuous; let Y be Hausdorff. Show that 
if f may be extended to a continuous function g . A — Y, then g is uniquely 
determined by f 
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We now return, for the remainder of the chapter, to the consideration of various meth- 
ods for imposing topologies on sets 
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Previously, we defined a topology on the product X x Y of two topological spaces, 
In the present section, we generalize this definition to more general cartesian products. 
So let us consider the cartesian products 


Xix- xX and X, x X2x---, 


where each X; is a topological space There are two possible ways to proceed. One 
way is to take as basis all sets of the form U; x -- x Un in the first case, and of the 
form U; x U2 x -- in the second case, where U; is an open set of X; for each i. This 
procedure does indeed define a topology on the cartesian product; we shall call it the 
box topology. 

Another way to proceed is to generalize the subbasis formulation of the definition, 
given in §15. In this case, we take as a subbasis all sets of the form x; '(U;), where i is 
any index and U; is an open set of X;. We shall call this topology the product topology. 

How do these topologies differ? Consider the typical basis element B for the 
second topology. It is a finite intersection of subbasis elements n'(U;), say fori = 
ij, ..., ik. Then a point x belongs to B if and only if 2,(x) belongs to U, for i = 
ii.. ., ix; there is no restriction on 7; (x) for other values of i. 

It follows that these two topologies agree for the finite cartesian product and differ 
for the infinite product. What is not clear is why we seem to prefer the second topology. 
This is the question we shall explore in this section 

Before proceeding, however, we shall introduce a more general notion of cartesian 
product. So far, we have defined the cartesian product of an indexed family of sets 
only in the cases where the index set was the set {1, .., n} or the set Z} Now we 
consider the case where the index set is completely arbitrary. 


Definition. Let J be an index set. Given a set X, we define a J-tuple of elements 
of X to be a function x : J — X. If a is an element of J, we often denote the value 
of x at œ by xy rather than x(a); we call it the wth coordinate of x. And we often 
denote the function x itself by the symbol 


(a)acss 


which is as close as we can come to a “tuple notation” for an arbitrary index set J. We 
denote the set of all J-tuples of elements of X by X/. 


Definition. Let (Ag}acy be an indexed family of sets; let X = ye) Aa. The 
cartesian product of this indexed family, denoted by 

[I 4e. 

acs 


is defined to be the set of all J-tuples (xz)acy of elements of X such that xy € Ag for 
eacha € J That ts, itis the set of all functions 


x: J> U Ag 
acj 
such that x(œ) € Áa for eacha E€ J. 
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Occasionally we denote the product simply by [] Ay, and its general element 
by (xq), if the index set is understood 

If all the sets Ag are equal to one set X, then the cartesian product Ne ey Aa iS just 
the set X/ of all J-tuples of elements of X. We sometimes use “tuple notation” for 
the elements of X” , and sometimes we use functional notation, depending on which is 
more convenient. 


Definition. Let {Xy}ucy be an indexed family of topological spaces. Let us take as 
a basis for a topology on the product space 


[] xa 
aes 


the collection of all sets of the form 


[I v. 


aes 


where Uy is open in Xa, for each œ € J. The topology generated by this basis is called 
the box topology 


This collection satisfies the first condition for a basis because [] Xq is itself a basis 
element; and it satisfies the second condition because the intersection of any two basis 
elements is another basis element: 


| Gad VT] vo = [ [Uan Va). 


acj aed aed 


Now we generalize the subbasis formulation of the definition. Let 


mg: || Xe > Xp 


acl 
be the function assigning to each element of the product space its 8th coordinate, 
1g((Xeadacs) = Xp; 


it is called the projection mapping associated with the index £. 


Definition. Let Sg denote the collection 
Sp = (1g (Up) | Ug open in Xz}, 


and let $ denote the union of these collections, 
S= |] Sp. 
Bes 


The topology generated by the subbasis S is called the product topology. In this topol- 
ogy [lees Xa is called a product space. 
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To compare these topologies, we consider the basis B that $ generates. The col- 
lection B consists of all finite intersections of elements of 5. If we intersect elements 
belonging to the same one of the sets 5g, we do not get anything new, because 


ng (Up) Nag (Vg) = n5 (Ug N Ve); 


the intersection of two elements of Sg, or of finitely many such elements, is again an 
element of Sg. We get something new only when we intersect elements from different 
sets 5g. The typical element of the basis B can thus be described as follows: Let £1, 
..., Bn be a finite set of distinct indices from the index set J, and let Ug, be an open 
set in Xg; fori = 1,...,2. Then 


B= ng (Up) Nrg (Up) 0 -Nrg (Up) 


is the typical element of B. 
Now a point x = (xq) is in B if and only if ts £ıth coordinate is in Ug, , its 22th 
coordinate is in Ug,, and so on. There is no restnction whatever on the ath coordinate 


of x if æ is not one of the indices B;,. ., $a. As a result, we can wnite B as the product 
B = [ | va, 
aeJ 
where U, denotes the entire space Xa if æ Æ Bi,..., Bn. 


All this is summarized in the following theorem: 


Theorem 19.1 (Comparison of the box and product topologies). The box topol- 
ogy on [] Xq has as basis all sets of the form [| Ua, where Ua is open in Xa for 
each a. The product topology on [] Xa has as basis all sets of the form [| Ua, where 
Ua is open in Xq for each a and Ug equals Xa except for finitely many values of a. 


Two things are immediately clear First, for finite products [[}_, Xa the two 
topologies are precisely the same. Second, the box topology is in general finer than 
the product topology. 

What is not so clear is why we prefer the product topology to the box topology. The 
answer will appear as we continue our study of topology. We shall find that a number 
of important theorems about finite products will also hold for arbitrary products if we 
use the product topology, but not if we use the box topology. As a result, the product 
topology is extremely important in mathematics. The box topology is not so important; 
we shall use it pnmarily for constructing counterexamples. Therefore, we make the 
following convention: 


Whenever we consider the product [| Xa, we shall assume it is given the 
product topology unless we specifically state otherwise. 


Some of the theorems we proved for the product X x Y hold for the product [] Xa 
no matter which topology we use. We list them here; most of the proofs are left to the 
exercises. 
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Theorem 19.2. Suppose the topology on each space Xq is given by a basis By. The 
collection of all sets of the form 
[I 2e. 


ael 


where By € Ba for each æ, will serve as a basis for the box topology on Tee J Xa- 

The collection of alf sets of the same form, where Ba € Ba for finitely many 
indices œ and By = Xa for all the remaining indices, will serve as a basis for the 
product topology [yey Xa. 


EXAMPLE |. Consider euclidean n-space R”. A basis for R consists of all open intervals 
in R; hence a basis for the topology of R” consists of all products of the form 


(a), bı) x (a2, b2) X> X (an, bn). 


Since R” is a finite product, the box and product topologies agree Whenever we con- 
sider R”, we wiil assume that it is given this topology, unless we specifically state other- 
wise 


Theorem 19.3. Let A, be a subspace of Xa, for each œ € J. Then [] Ag is a 
subspace of [] Xq if both products are given the box topology, or if both products are 
given the product topology. 


Theorem 19.4. If each space Xq is a Hausdorff space, then [| Xa is a Hausdorff 
Space in both the box and product topologies. 


Theorem 19.5. Let {X,} be an indexed family of spaces; let Ag C Xa for each æ. If 
[] Xa is given either the product or the box topology, then 


iee 


Proof. Letx = (xa) be a point of [] Àa; we show that x € T] Aa. Let U =[] Ua be 
a basis element for either the box or product topology that contains x. Since xg € Aa, 
we can choose a point yg € Ug N Ag for each a. Then y = (ya) belongs to both U 
and [] Ag . Since U is arbitrary, it follows that x belongs to the closure of [] Aq. 
Conversely, suppose x = (Xa) lies in the closure of [] Ag, in either topology. We 
show that for any given index £, we have xg € Ag. Let Vg be an arbitrary open set 
of Xg containing xg. Since 5 ' (Ve) is open in [] Xq in either topology, it contains a 
point y = (ya) of [] Aa. Then yg belongs to Vg N Ag. It follows that xg € Âg. B 


So far, no reason has appeared for preferring the product to the box topology. It is 
when we try to generalize our previous theorem about continuity of maps into product 
spaces that a difference first arises. Here is a theorem that does not hold if [] Xa is 
given the box topology: 
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Theorem 19.6. Let f : A> [],<) Xa be given by the equation 


f(a) = (fal@))aes, 


where fy : A — Xa foreach a. Let [] Xq have the product topology. Then the 
function f is continuous if and only if each function fa is continuous. 


Proof. Let mg be the projection of the product onto its th factor. The function sg 
is continuous, for if Ug is open in Xg, the set xg '(Ug) is a subbasis element for the 
product topology on Xa. Now suppose that f : A — [] Xq is continuous. The 
function fg equals the composite 7g o f; being the composite of two continuous 
functions, it is continuous. 

Conversely, suppose that each coordinate function fy is continuous. To prove 
that f is continuous, it suffices to prove that the inverse image under f of each subbasis 
element is open in A, we remarked on this fact when we defined continuous functions. 
A typical subbasis element for the product topology on []Xq is a set of the form 
(Ug), where $ is some index and Ug is open in Xg. Now 


far Ug) = fg (Up), 


because fg = 7g o f. Since fg is continuous, this set is open in A, as desired. a 


Why does this theorem fail if we use the box topology? Probably the most con- 
vincing thing to do is to look at an example. 


EXAMPLE 2 Consider R”, the countably infinite product of R with itself. Recall that 
R”? = |] x 
neZ, 


where X, = R foreach n Let us define a function f R — R” by the equation 


f= (t, t,t, 


the nth coordinate function of f is the function fn (t) = t. Each of the coordinate functions 
fn . R — R is continuous; therefore, the function f is continuous if R® is given the 
product topology. But f is not continuous if R® is given the box topology Consider, for 
example, the basis element 


1 1 11 
B= (1,1) x (5.5) ($5.5) x 


for the box topology. We assert that f—'(B) is not open in R. If f ~!(B) were open 
in R, it would contain some interval (—6, 6) about the point 0. This would mean that 
F£((—6, 5)) C B, so that, applying 7n to both sides of the inclusion, 


fn((—6, 6)) = (~ô, 8) C (—1/n, 1/n) 


for all n, a contradiction 
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Exercises 


ms WN = 


10. 


. Prove Theorem 19.2 

. Prove Theorem 19.3. 

. Prove Theorem 19.4 

. Show that (X; x -- x Xn-1) x Xn is homeomorphic with X; x --- x Xn. 

- One of the implications stated in Theorem 19.6 holds for the box topology. 


Which one? 


. Let x1, X2,... be a sequence of the points of the product space [] Xa. Show that 


this sequence converges to the point x if and only if the sequence 7¢(X1), Ta (X2), 
... converges to q(x) for each æ. Is this fact true if one uses the box topology 
instead of the product topology? 


© Let R” be the subset of R” consisting of all sequences that are “eventually zero,” 


that is, all sequences (x1, x2, ...) such that x; # 0 for only finitely many values 
of i. What is the closure of R® in R” in the box and product topologies? Justify 
your answer. 


. Given sequences (a), a2,...) and (b1, b2,...) of real numbers with a; > O for 


alli, define h : RY — R® by the equation 
h((x1, x2, -.-)) = (axı + b1, 2x2 +2,...). 


Show that if R” is given the product topology, h is a homeomorphism of R® with 
itself. What happens if R” is given the box topology? 


. Show that the choice axiom is equivalent to the statement that for any indexed 


family (Aa}aes of nonempty sets, with J # 0, the cartesian product 


[] 40 
aes 
is not empty. 
Let A be a set; let {Xa}acy be an indexed family of spaces; and let { fy }auej be 
an indexed family of functions fe : A > Xa. 
(a) Show there is a unique coarsest topology 7 on A relative to which each of 
the functions fæ is continuous. 
(b) Let 


Sp = (f5 (Up) | Ug is open in Xg}, 


and let $ = |] Sg. Show that $ is a subbasis for T 

(c) Show that a map g : Y — A is continuous relative to J if and only if each 
map fa © g is continuous. 

(d) Let f : A + [] Xq be defined by the equation 


f(a) = (fal@)aes; 


let Z denote the subspace f(A) of the product space [] Xa. Show that the 
image under f of each element of F is an open set of Z. 
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§20 The Metric Topology 


One of the most important and frequently used ways of imposing a topology on a set is 
to define the topology in terms of a metric on the set. Topologies given in this way lie 
at the heart of modern analysis, for example. In this section, we shall define the metric 
topology and shall give a number of examples. In the next section, we shall consider 
some of the properties that metnc topologies satisfy. 


Definition. A metric on a set X is a function 
d:XxX— R 


having the following properties: 
(1) d(x, y) = 0 for all x, y € X; equality holds if and only if x = y. 
(2) d(x, y) = d(y, x) forall x, y € X. 
(3) (Triangle inequality) d(x, y) + dy, z) > d(x, z), for all x, y,z € X. 


Given a metric d on X, the number d(x, y) is often called the distance between x 
and y in the metric d Given € > 0, consider the set 


Ba(x, €) = {y | d(x. y) < €} 


of all points y whose distance from x is less than e. It is called the e-ball centered 
at x. Sometimes we omit the metric d from the notation and wnte this ball simply as 
B(x, €), when no confusion will anse. 


Definition. Ifd is a metric on the set X, then the collection of all €-balls By (x, €), for 
x € X ande > 0, is a basis for a topology on X, called the metric topology induced 
by d. 


The first condition for a basis is tnvial, since x € B(x, €) for any € > O. Before 
checking the second condition for a basis, we show that if y is a point of the basis 
element B(x, €), then there is a basis element B(y, 5) centered at y that is contained 
in B(x, €). Define ô to be the positive number e€ — d(x, y). Then B(y, 5) C B(x, €), 
for if z € B(y, ô), then d(y, z) < € — d(x, y), from which we conclude that 


d(x,z) < d(x, y) + dy, 2) < €. 


See Figure 20.1. 

Now to check the second condition for a basis, let B; and Bz be two basis elements 
and let y € 8} B2. We have just shown that we can choose positive numbers ô; and 62 
so that B(y, 5;) C Bı and B(y, 52) C B2. Letting 5 be the smaller of 5; and 52, we 
conclude that B(y, 6) C B1 N B2. 

Using what we have just proved, we can rephrase the definition of the metric topol- 
ogy as follows: 
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Figure 20.1 


A set U is open in the metric topology induced by d if and only if for each 
y E€ U, there isa 5 > 0 such that Bg(y, 8) C U. 


Clearly this condition implies that U is open. Conversely, if U is open, it contains 
a basis element B = Bg(x, <€) containing y, and B in turn contains a basis element 
Ba(y, 5) centered at y 


EXAMPLE | Given a set X, define 
d(ix,y)=1 ifx #y, 
d(ix,y)=0 ifx=y 
Tt is trivial to check that d is a metne. The topology it induces is the discrete topology; the 


basis element B(x, 1), for example, consists of the point x alone. 


EXAMPLE 2. The standard metnc on the real numbers R is defined by the equation 
d(x, y) =|x -yi 


It is easy to check that d is a metric. The topology it induces is the same as the order 
topology: Each basis element (a, b) for the order topology is a basis element for the metric 
topology, indeed, 


(a, b) = B(x, €), 


where x = (a + b)/2 and € = (b — a)/2. And conversely, each ¢-ball B(x, €) equals an 
open interval the interval (x — €, x +€). 


Definition. If X is a topological space, X is said to be metrizable if there exists a 
metnc d on the set X that induces the topology of X. A metric space is a metnizable 
space X together with a specific metric d that gives the topology of X. 


Many of the spaces important for mathematics are metnzable, but some are not. 
Metrizability is always a highly desirable attribute for a space to possess, for the exis- 
tence of a metric gives one a valuable tool for proving theorems about the space. 
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It is, therefore, a problem of fundamental importance in topology to find condi- 
tions on a topological space that will guarantee it is metrizable. One of our goals in 
Chapter 4 will be to find such conditions; they are expressed there in the famous the- 
orem called Urysohn's metrization theorem. Further metrization theorems appear in 
Chapter 6. In the present section we shall content ourselves with proving merely that 
R" and R® are metnzable. 

Although the metrizability problem is an important problem in topology, the study 
of metne spaces as such does not properly belong to topology as much as it does 
to analysis Metnzability of a space depends only on the topology of the space in 
question, but properties that involve a specific metric for X in general do not. For 
instance, one can make the following definition in a metric space. 


Definition. Let X be a metnc space with metnc d. A subset A of X is said to be 
bounded if there is some number such that 
d(aı,a2) < M 


for every pair a1, a2 of points of A. If A is bounded and nonempty, the diameter of A 
is defined to be the number 


diam A = sup{d (a1, a2) | a1, a2 € A}. 


Boundedness of a set is not a topological property, for it depends on the particular 
metnic d that is used for X. For instance, if X is a metnc space with metric d, then 
there exists a metric d that gives the topology of X, relative to which every subset of X 
is bounded. It ts defined as follows: 


Theorem 20.1. Let X be a metric space with metric d. Defined : X x X — R by 
the equation 

d(x, y) = min{d(x, y), 1} 
Then d is a metric that induces the same topology as d. 


The metne d is called the standard bounded metric corresponding to d. 


Proof. Checking the first two conditions for a metnc is trivial. Let us check the 
tnangle inequality: 


d(x, z) < d(x, y) + d(y, z). 


Now if either d(x, y) > 1l or d(y,z) > 1, then the nght side of this inequality is at 
least 1, since the left side is (by definition) at most 1, the inequality holds. It remains 
to consider the case in which d(x, y) < 1 and d(y, z) < 1. In this case, we have 


d(x, 2) < d(x, y) + d(y, 2) = d(x, y) + d(y, 2). 


Since d(x, z) < d(x, z) by definition, the tnangle inequality holds for d. 
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Now we note that in any metnce space, the collection of €-balls with € < 1 forms 

a basis for the metric topology , for every basis element containing x contains such an 
€-ball centered at x. It follows that d and d induce the same topology on X, because 
the collections of ¢-balls with € < 1 under these two metrics are the same collection. 
a 


Now we consider some familiar spaces and show they are metnizable. 


Definition. Given x = (xı Xn) in R", we define the norm of x by the equation 


xi = (x2 + ae + x2), 


and we define the euclidean metric d on R" by the equation 


d(x, y) = hx — yll = (0 yi)? H + tn — yn). 


We define the square metric p by the equation 
p(x, y) = max{|xi — yil.---, en — Yall 


The proof that d is a metnc requires some work; it is probably already familiar to 
you. If not, a proof is outlined in the exercises. We shall seldom have occasion to use 
this metne on R”. 

To show that p is a metric is easier. Only the tnangle inequality is nontnvial. From 
the tnangle inequality for R it follows that for each positive integer i, 


þa — zil < pa — yil + lyi — zil. 
Then by definition of p, 
Ix, — zi] < p(x, y) + p(y, 2). 
As a result 
p(x, z) = max{|x; — zil} < p(x, Y) + aly. z), 


as desired. 

On the real line R = R!, these two metrics coincide with the standard metric 
for R. In the plane R?, the basis elements under d can be pictured as circular regions, 
while the basis elements under p can be pictured as square regions. 

We now show that each of these metrics induces the usual topology on R”. We 
need the following lemma: 


Lemma 20.2. Letd and d’ be two metrics on the set X; let T and T’ be the topologies 
they induce, respectively. Then J’ is finer than T if and only if for each x in X and 
each € > 0, there exists a ô > 0 such that 


Bg (x, 8) C Ba(x, €) 
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Proof. Suppose that 7’ is finer than 7 Given the basis element B4(x, €) for J , there 
1s by Lemma 13.3 a basis element B’ for the topology J’ such that x € B’ C Ba(x, €). 
Within B’ we can find a ball By (x, ô) centered at x. 

Conversely, suppose the ô- condition holds Given a basis element B for F con- 
taining x, we can find within B a ball Bg(x, €) centered at x. By the given Condition, 
there is a ô such that By (x, 8) C Ba(x, €). Then Lemma 13.3 applies to show J’ is 
finer than F. i 


Theorem 20.3. The topologies on R" induced by the euclidean metric d and the 
Square metric p are the same as the product topology on R”. 


Proof. Letx = (x1, ..,Xn) and y = (y1,.... Yn) be two points of R”. It is simple 
algebra to check that 


p(x. y) < d(x, y) < Vnp(x, y) 
The first inequality shows that 
Ba(x, €) C Bo(x, €) 


for all x and €, since if d(x,y) < €, then p(x, y) < € also. Similarly, the second 
inequality shows that 


Bp(x,€/ vn) C Ba(x, €) 


for all x and €. It follows from the preceding lemma that the two metnc topologies are 
the same. 

Now we show that the product topology is the same as that given by the metnc p. 
First, let 


B = (a,b\) x + X (an, bn) 


be a basis element for the product topology, and let x = (11,...,2,) be an element 
of B. For each i, there is an e€; such that 


(xı — €i, Xi +4) C (ai, bi), 


choose € = min{e;, .., €n}. Then B,(x,€) C B, as you can readily check. As a 
result, the p-topology is finer than the product topology. 

Conversely, let B, (x, €) be a basis element for the p-topology. Given the element 
y € B, (x, €), we need to find a basis element B for the product topology such that 


y€ B C B(x, €). 
But this is trivial, for 
Bo(x, €) = (41 — €, X1 +€) X +++ X (Xn — €, Xn +€) 


is itself a basis element for the product topology. a 
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Now we consider the infinite cartesian product R”. It is natural to try to generalize 
the metrics d and p to this space. For instance, one can attempt to define a metric d 
on R” by the equation 


3 1/2 
d(x, y) = px = n| . 
i=l 


But this equation does not always make sense, for the senes in question need not 
converge. (This equation does define a metric on a certain important subset of R”, 
however; see the exercises.) 

Similarly, one can attempt to generalize the square metric p to R® by defining 


(x, y) = sup{ixn — yal}. 


Again, this formula does not always make sense. If however we replace the usual 
metnc d(x, y) = |x — y| on R by its bounded counterpart d(x, y) = min{|x — yl, I}, 
then this definition does make sense; it gives a metric on R® called the uniform metric. 

The uniform metric can be defined more generally on the cartesian product R/ for 
arbitrary J, as follows: 


Definition. Given an index set J, and given points x = (Xg)vey and y = (yw)wes 
of R’, let us define a metric f on R” by the equation 


A(x, y) = sup(d(xa, Ya) | & € J}, 


where d is the standard bounded metric on R. It is easy to check that 5 is indeed a 
metric; it is called the uniform metric on R” , and the topology it induces is called the 
uniform topology. 


The relation between this topology and the product and box topologies is the fol- 
lowing: 


Theorem 20.4. The uniform topology on R/ is finer than the product topology and 
coarser than the box topology; these three topologies are all different if J is infinite. 


Proof. Suppose that we are given a point x = (%q)aey and a product topology basis 
element [] Ua about x. Let a1,...,@, be the indices for which Ue # R. Then for 
each i, choose €; > 0 so that the €;-ball centered at xg, in the d metric is contained 
in Ug,; this we can do because Ug, is open in R. Let € = min{e;,..., €n}; then the 
€-ball centered at x in the p metric is contained in [] Ug. For if z is a point of R/ such 
that 6(x,z) < €, then d (Xa, Za) < € forall æ, so that z € [] Uy. It follows that the 
uniform topology is finer than the product topology. 

On the other hand, let B be the €-ball centered at x in the 6 metric. Then the box 
neighborhood 


U= T]@« - le, Xa + łe) 
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of x is contained in B. For if y € U, then d(xw, Ya) < 4€ for all a, so that A(x, y) < 
łe. 

Showing these three topologies are different if J is infinite is a task we leave to 
the exercises. a 


In the case where J is infinite, we still have not determined whether R/ is metnz- 
able in either the box or the product topology. It turns out that the only one of these 
cases where R/ is metnzable is the case where J is countable and R/ has the product 
topology. As we shall see. 


Theorem 20.5. Let d(a, b) = min{|a — b|, 1} be the standard bounded metric on R. 
If x and y are two points of R”, define 


dG. yi 
D(x, y) = sup [252] ; 
Then D is a metric that induces the product topology on R®. 


Proof. The properties of a metric are satisfied trivially except for the tnangle inequal- 
ity, which is proved by noting that for all í, 


dli, z) < dn, yi) + 41, zi) < D(x, y) + DY, 2), 
t 


i i 


so that 
d is li 
up (“=| < D(x, Y) + DYy,2). 


The fact that D gives the product topology requires a little more work. First, let U 
be open in the metric topology and let x € U; we‘find an open set V in the product 
topology such that x € V C U. Choose an e€-ball Bp(x, €) lying in U. Then choose N 
large enough that 1/N < €. Finally, let V be the basis element for the product topology 


V = (4 —€,4) +6) X--> x ON E xN +e) x Rx Rx---. 
We assert that V C Bp(x, €): Given any y in R”, 


d(xi, yi) 


i 


< fori > N. 


z|- 


Therefore, 


D(x, y) < max 


d(ixiyi) — diw, yn) 1 
Poe N ‘NY’ 


If y is in V, this expression is less than €, so that V C Bp(x, €), as desired. 
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Conversely, consider a basis element 


U = [|v 


ieZ4 


for the product topology, where U; is open in R fori = ay,...,@, and U; = R for all 
other indices i. Given x € U, we find an open set V of the metric topology such that 
x € V C U. Choose an interval (x, — €;, x; + €;) in R centered about x, and lying 
in U; fori = æ}, .. ., €n; choose each €, < 1. Then define 


€ =min{e;/i | i =], ..., an}. 
We assert that 
xe Bp(x, e) CU. 


Let y be a point of Bp(x, €). Then for all i, 


di, yi) < D(x, y) <€. 
i 
Now if i = ay, ..., Œn, then € < €;/i, so that d(x, yi) < €; < 1; it follows that 
|x, — yil < €i. Therefore, y € [] U,, as desired. | 


Exercises 
1. (a) In R”, define 
d'(x, y) = |x — yil +--+ + len — yal: 


Show that d’ is a metric that induces the usual topology of R”. Sketch the 
basis elements under d’ when n = 2. 
(b) More generally, given p > 1, define 


, Up 
d'(x, y) = [È ixi - wt] 
i=] 


for x, y € R". Assume that d’ is a metric. Show that it induces the usual 
topology on R”. 
2. Show that R x R in the dictionary order topology is metrizable. 
3. Let X be a metric space with metric d. 
(a) Show thatd : X x X > R is continuous. 
(b) Let X’ denote a space having the same underlying set as X. Show that if 
d : X' x X’ —> R is continuous, then the topology of X’ is finer than the 
topology of X. 
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One can summarize the result of this exercise as follows: If X has a metric d, 
then the topology induced by d is the coarsest topology relative to which the 
function d is continuous. 


Consider the product, uniform, and box topologies on R”. 
(a) In which topologies are the following functions from R to R® continuous? 


f(t) = @, 20, 38,222), 
gi) = (E, t,t, 
A(t) = (t, 40, $,...). 

(b) In which topologies do the following sequences converge? 
w=(Lii...), x =(1,1,1,1,...), 
w2 = (0,2,2,2,...), m= (0,5,5,4,...), 
w3 = (0,0,3,3,...), x3 = (0,0, $.4...), 


yı =(1,0,0,0,...), zı =(1,1,0,0,...), 
y2 =(5,4,0,0,...), 22 =(5,4.0,0,...), 
¥3 =(5.5.5-0.-..), 23 = (4, 5,0,0,...), 


. Let R” be the subset of R® consisting of all sequences that are eventually zero. 


What is the closure of R” in R® in the uniform topology? Justify your answer. 


. Let p be the uniform metric on R®. Given x = (x1, x2,...) € R® and given 


O<e<I,let 
U(x, €) = (41 — €, X] +E) X- -- X (Xn — E, Xn HE) KO. 


(a) Show that U(x, €) is not equal to the e-ball B5(x, €). 
(b) Show that U(x, €) is not even open in the uniform topology. 
(c) Show that 


Ba(x,€) = (JUG, 8). 


b<e 


© Consider the map h : R® — R” defined in Exercise 8 of §19; give R® the uni- 


form topology. Under what conditions on the numbers a; and b; is h continuous? 
a homeomorphism? 


. Let X be the subset of R” consisting of all sequences x such that $` x? converges. 


Then the formula 


E 1/2 
d(x,y) = [Że - 0 
i=l 
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defines a metric on X. (See Exercise 10.) On X we have the three topologies it 
inherits from the box, uniform, and product topologies on R®. We have also the 
topology given by the metric d, which we call the €7-topology. (Read “little ell 
two.”) 

(a) Show that on X, we have the inclusions 


box topology > é?-topology > uniform topology. 


(b) The set R? of all sequences that are eventually zero is contained in X. Show 
that the four topologies that R© inherits as a subspace of X are all distinct. 
(c) The set 


H = [] (01/0) 


neZ} 


is contained in X; it is called the Hilbert cube. Compare the four topologies 
that H inherits as a subspace of X. 


. Show that the euclidean metric d on R” is a metric, as follows: If x, y € R” and 


c E€ R, define 


XHY = (x1 +y- -Xn + Yn), 
cx = (as Ca) 
X- y= xyi Heet Xnyn- 


(a) Show that x - (y + z) = (x - y) + (x - z). 

(b) Show that |x-y| < ||xii[ly|]. [Hine If x, y 4 0, leta = 1/||x|| and b = 1/llyll, 
and use the fact that ||ax + by|| > 0.} 

(c) Show that |x + yj] < iix + llyll. {Hint: Compute (x + y) - (x + y) and 
apply (b).} 

(d) Verify that d is a metric. 

Let X denote the subset of R” consisting of all sequences (x), x2, ...) such that 

yx? converges. (You may assume the standard facts about infinite series. In 

case they are not familiar to you, we shall give them in Exercise 1! of the next 

section.) 

(a) Show that if x, y € X, then $. |x, y,| converges. [Hint: Use (b) of Exercise 9 
to show that the partial sums are bounded.} 

(b) Let c €e R. Show that if x, y € X, then so are x + y and cx. 

(c) Show that 


os 1/2 
d(x,y) = [Sets ~ »| 
i=l 


is a well-defined metric on X. 
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*11. Show that if d is a metric for X, then 
d'(x, y) = d(x, y)/(1 +d(x, y)) 


is a bounded metric that gives the topology of X. [Hint: If f(x) = x/(1 + x) for 
x > 0, use the mean-value theorem to show that f(a + b) — f(b) < f (a)] 


§21 The Metric Topology (continued) 


In this section, we discuss the relation of the metric topology to the concepts we have 
previously introduced. 

Subspaces of metric spaces behave the way one would wish them to; if A is a 
subspace of the topological space X and d is a metric for X, then the restriction of d 
to A x A is a metric for the topology of A. This we leave to you to check. 

About order topologies there is nothing to be said; some are metrizable (for in- 
stance, Z, and R), and others are not, as we shall see. 

The Hausdorff axiom is satisfied by every metric topology. If x and y are distinct 
points of the metric space (X, d), we let € = Sd(x, y), then the triangle inequality 
implies that Bg(x, €) and Bg(y, €) are disjoint. 

The product topology we have already considered in special cases; we have proved 
that the products R” and R® are metrizable. It is true in general that countable products 
of metrizable spaces are metrizable; the proof follows a pattern simular to the proof 
for R®, so we leave it to the exercises. 

About continuous functions there is a good deal to be said. Consideration of this 
topic will occupy the remainder of the section. 

When we study continuous functions on metric spaces, we are about as close to 
the study of calculus and analysis as we shall come in this book. There are two things 
we want to do at this point. 

First, we want to show that the familiar “e-5 definition” of continuity carries over 
to general metric spaces, and so does the “convergent sequence definition” of continu- 
ity. 

Second, we want to consider two additional methods for constructing continuous 
functions, besides those discussed in §18. One is the process of taking surns, differ- 
ences, products, and quotients of continuous real-valued functions. The other is the 
process of taking limuts of uniformly convergent sequences of continuous functions. 


Theorem 21.1. Let f : X — Y; let X and Y be metrizable with metrics dx and dy, 
respectively. Then continuity of f is equivalent to the requirement that given x € X 
and given € > 0, there exists ô > 0 such that 


dx(x, y) < 5 => dy(f(x), fO) < €. 


Proof. Suppose that f is continuous. Given x and €, consider the set 


FBI œ), ©), 
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which is open in X and contains the point x. It contains some 6-ball B(x, 5) centered 
at x. If y is in this 6-ball, then f(y) is in the €-ball centered at f(x), as desired. 
Conversely, suppose that the €-5 condition is satisfied. Let V be open in Y; we 
show that f—!(V) is open in X. Let x be a point of the set f~'(V). Since f(x) € 
V, there is an €-ball B( f(x), €) centered at f(x) and contained in V. By the €- 
ô condition, there is a 6-ball B(x, ô) centered at x such that f(B(x,4)) C B(f (x), €). 
Then B(x, 5) is a neighborhood of x contained in f—'(V), so that f~'(V) is open, as 
desired. E 


Now we turn to the convergent sequence definition of continuity. We begin by 
considering the relation between convergent sequences and closures of sets. It is cer- 
tainly believable, from one’s experience in analysis, that if x lies in the closure of a 
subset A of the space X, then there should exist a sequence of points of A converging 
to x. This is not true in general, but it is true for metrizable spaces. 


Lemma 21.2 (The sequence lemma). Let X bea topological space; let A C X. If 
there is a sequence of points of A converging to x, then x € A; the converse holds if X 
is metrizable. 


Proof. Suppose that x, —> x, where x, € A. Then every neighborhood U of x 
contains a point of A, so x € A by Theorem 17.5. Conversely, suppose that X is 
metrizable and x € A. Let d be a metric for the topology of X. For each positive 
integer n, take the neighborhood Bz(x, 1/n) of radius 1/n of x, and choose x, to be 
a point of its intersection with A. We assert that the sequence x, converges to x: Any 
open set U containing x contains an €-ball By(x, €) centered at x; if we choose N so 
that 1/N < €, then U contains x; for alli > N. a 


Theorem 21.3. Let f : X — Y. If the function f is continuous, then for every con- 
vergent sequence x, —> x in X, the sequence f (xn) converges to f (x). The converse 
holds if X is metrizable. 


Proof. Assume that f is continuous. Given x, — x, we wish to show that f (xa) > 
f (x). Let V be a neighborhood of f(x). Then f~!(V) is a neighborhood of x, and so 
there is an N such that x, € f~'(V) forn > N. Then f(x) € V forn > N. 

To prove the converse, assume that the convergent sequence condition is satisfied. 
Let A be a subset of X; we show that f(A) Cc f(A). Ifx € A, then there is a 
sequence x, of points of A converging to x (by the preceding lemma). By assumption, 
the sequence f(x,) converges to f(x). Since f(x,) € f(A), the preceding lemma 
implies that f(x) € f(A). (Note that metrizability of Y is not needed.) Hence f(A) Cc 
f(A), as desired. a 


Incidentally, in proving Lemma 21.2 and Theorem 21.3 we did not use the full strength 
of the hypothesis that the space X is metrizable. Ail we really needed was the countable 
collection By(x, 1/1) of balls about x. This fact leads us to make a new definition. 

A space X is said to have a countable basis at the point x if there is a countable 
collection {Un }nez, Of neighborhoods of x such that any neighborhood U of x contains at 
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least one of the sets U,. A space X that has a countable basis at each of its points is said to 
satisfy the first countability axiom. 

If X has a countable basis {U,,} at x, then the proof of Lemma 21.2 goes through; one 
simply replaces the ball B4 (x, 1/7) throughout by the set 


Ba = ULAULA. -AUnr. 


The proof of Theorem 21.3 goes through unchanged. 

A metrizable space always satisfies the first countability axiom, but the converse is not 
true, as we shall see. Like the Hausdorff axiom, the first countability axiom is a requirement 
that we sometimes impose on a topological space in order to prove stronger theorems about 
the space. We shall study it in more detail in Chapter 4. 


Now we consider additional methods for constructing continuous functions. We 
need the following lemma: 


Lemma 21.4. The addition, subtraction, and multiplication operations are continu- 
ous functions from R x R into R; and the quotient operation is a continuous function 
from R x (R — {0}) into R. 


You have probably seen this lemma proved before; it is a standard “e-5 argument.” 
If not, a proof is outlined in Exercise 12 below; you should have no trouble filling in 
the details. 


Theorem 21.5. If X is a topological space, and if f,g : X — R are continuous 
functions, then f + g, f — g, and f - g are continuous. If g(x) £ 0 for all x, then f/g 
is continuous. 


Proof. The map h : X — R x R defined by 
h(x) = f(x) x g(x) 


is continuous, by Theorem 18.4. The function f + g equals the composite of h and 
the addition operation 


+:RxR-R; 
therefore f + g is continuous. Similar arguments apply to f — g, f -g,and f/g. E 


Finally, we come to the notion of uniform convergence. 


Definition. Let fa : X — Y be a sequence of functions from the set X to the metric 
space Y. Let d be the metric for Y. We say that the sequence ( fn) converges uniformly 
to the function f : X — Y if given e > 0, there exists an integer N such that 


d(fr(x), f(x) < € 
for alln > N and all x in X. 


Uniformity of convergence depends not only on the topology of Y but also on its 
metric. We have the following theorem about uniformly convergent sequences: 
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Theorem 21.6 (Uniform limit theorem). Let fn : X — Y be a sequence of contin- 
uous functions from the topological space X to the metric space Y. If ( fa) converges 
uniformly to f , then f is continuous. 


Proof. Let V be open in Y; let xo be a point of f~'(V). We wish to find a neighbor- 
hood U of xo such that f(U) C V. 

Let yo = f (xo). First choose € so that the €-ball B(yo, €) is contained in V. Then, 
using uniform convergence, choose N so that forall n > N andall x € X, 


d( falx), f(x)) < €/3. 


Finally, using continuity of fy, choose a neighborhood U of xo such that fy carries U 
into the € /3 ball in Y centered at fy (xo). 

We claim that f carries U into B(yo, <€) and hence into V, as desired. For this 
purpose, note that if x € U, then 


d(f (x), fr(x)) < €/3 (by choice of N), 
d( f(x), fu(xo)) < €/3 (by choice of U), 
d(fu(xo), f(x0)) < €/3 (by choice of N). 


Adding and using the triangle inequality, we see that d( f(x), f(xo)) < €, as 
desired. = 


Let us remark that the notion of uniform convergence is related to the definition of 
the uniform metric, which we gave in the preceding section. Consider, for example, 
the space RY of all functions f : X — R, in the uniform metric #. It is not difficult to 
see that a sequence of functions f, : X — R converges uniformly to f if and only if 
the sequence (f,) converges to f when they are considered as elements of the metric 
space (IR*, 5). We leave the proof to the exercises. 

We conclude the section with some examples of spaces that are not metrizable. 

EXAMPLE |. R® in the box topology is not metrizable. 

We shall! show that the sequence lemma does not hold for R”. Let A be the subset of 

R” consisting of those points all of whose coordinates are positive: 


A= {(x1,%2,.-.) | x, > Oforalli € Z4}. 


Let 0 be the “origin” in R®, that is, the point (0, 0, ...) each of whose coordinates is zero. 
Tn the box topology, 0 belongs to A; for if 


B = (a1, bi) x (a2, b2) x +-+ 
is any basis element containing 0, then B intersects A. For instance, the point 
(fbr, 52...) 


belongs to B N A. 
But we assert that there is no sequence of points of A converging to 0. For let (a,,) be 
a sequence of points of A, where 


an = (Xin, Taie Tiny) 
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Every coordinate x,,, is positive, so we can construct a basis element B’ for the box topol- 
ogy on R by setting 


B’ = (x11, x11) X (—%22, 422) X + 


Then B’ contains the origin 0, but it contains no member of the sequence (a,); the 
point a, cannot belong to B’ because its nth coordinate x,, does not belong to the interval 
(—4Xnn, Xnn). Hence the sequence (a,) cannot converge to 0 in the box topology. 


EXAMPLE 2. An uncountable product of R with itself is not metrizable. 

Let J be an uncountable index set; we show that R/ does not satisfy the sequence 
lemma (in the product topology) 

Let A be the subset of R? consisting of all points (x) such that xz = 1 for all but 
finitely many values of a. Let 0 be the “origin” in R7, the point each of whose coordinates 
is 0. 

We assert that 0 belongs to the closure of A. Let J] Ua be a basis element containing 0. 
Then Ua # R for only finitely many values of a, say fora = a), ....a,,. Let (xq) be the 
point of A defined by letting x, =0 fora =a), ..., a, and xa = | for all other values of 
æ; then (xy) € AN [| Ua, as desired. 

But there is no sequence of points of A converging to 0. For let a, be a sequence of 
points of A. Given n, let J, denote the subset of J consisting of those indices œ for which 
the ath coordinate of a, is different from 1. The union of all the sets J, is a countable 
union of finite sets and therefore countable. Because J itself is uncountable, there is an 
index in J, say £, that does not lie in any of the sets Ja. This means that for each of the 
points a, its Bth coordinate equals 1. 

Now let Ug be the open interval (—1, 1) in R, and let U be the open set ny (Up) 
in R”. The set U is a neighborhood of 0 that contains none of the points ap; therefore, the 
sequence a, cannot converge to 0. 


Exercises 


1. Let A C X. If d is a metric for the topology of X, show that d|A x A is a metric 
for the subspace topology on A. 


2. Let X and Y be metric spaces with metrics dy and dy, respectively. Let f : 
X — Y have the property that for every pair of points x1, x2 of X, 


dy (f (x1), f (%2)) =dx(x1, x2). 


Show that f is an imbedding. It is called an isometric imbedding of X in Y. 


3. Let X, be a metric space with metric da, for n € Z4. 
(a) Show that 


p(x, y) = max{d (x1, y1),---, dain, Yn)} 


is a metric for the product space X; x --- x Xn. 
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(b) Let d; = min{d,, 1). Show that 
D(x, y) = supid (œ, yi)/i)} 


is a metric for the product space [] X,. 


. Show that Ry and the ordered square satisfy the first countability axiom. (This 


result does not, of course, imply that they are metnizable.) 


. Theorem. Let xq — x and yn — y in the space R. Then 


Xn + yn > x+y, 
Xn —Yn > x-y, 
AnYn = xy, 
and provided that each y, #0 and y £0, 
Xn/Yn > X/y. 


{Hint: Apply Lemma 21.4; recall from the exercises of §19 that if x, — x and 
Yn > Y, then xy X yn > x x y] 


. Define fn : [0, 1] + R by the equation f,(x) = x”. Show that the sequence 


(fn(x)) converges for each x € [0, 1}, but that the sequence ( fan) does not con- 
verge uniformly. 


. Let X be a set, and let fa : X — R be a sequence of functions. Let p be 


the uniform metric on the space R*. Show that the sequence (fa) converges 
uniformly to the function f : X — R if and only if the sequence ( f,) converges 
to f as elements of the metric space (RŽ, p). 


. Let X be a topological space and let Y be a metric space. Let fa : X > Y 


be a sequence of continuous functions. Let x, be a sequence of points of X 
converging to x. Show that if the sequence ( fa) converges uniformly to f, then 
(fn(xn)) converges to f(x). 


. Let fa : R — R be the function 


l 

{x — (I/R 

See Figure 21.1. Let f : R — R be the zero function. 

(a) Show that f,(x) > f(x) foreach x € R. 

(b) Show that fa does not converge uniformly to f. (This shows that the con- 
verse of Theorem 21.6 does not hold; the limit function f may be continuous 
even though the convergence is not uniform.) 

Using the closed set formulation of continuity (Theorem 18.1), show that the 

following are closed subsets of R?: 


fa) = 


A={xxylxy=l]}, 
S! ={xxylx +y =1}, 
B? = {xx ylz? +y <1). 
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Figure 21.1 


The set B? is called the (closed) unit ball in R?. 
11. Prove the following standard facts about infinite series: 
(a) Show that if (s,) is a bounded sequence of real numbers and sn < s,4, for 
each n, then (sn) converges. 
(b) Let (an) be a sequence of real numbers; define 


n 
oye 


i=] 


If Sn — s, we say that the infinite series 


converges to s also. Show that if }- a; converges to s and )~ b; converges 
tot, then $ (ca; + bi) converges to cs + t. 

(c) Prove the comparison test for infinite series: If |a;| < b; for each i, and if 
the series }_ b; converges, then the series } a; converges. (Hint: Show that 
the series }_ |a;] and Ý c; converge, where c; = |a;| + a;.] 


(d) Given a sequence of functions fan : X — R, let 


~~ 


salt) = >> fil). 
i=] 


Prove the Weierstrass M-test for uniform convergence: If | f;(x)| < Mi for 

all x € X and alli, and if the series $ M; converges, then the sequence (Sn) 

converges uniformly to a function s. (Hint: Let r, = ae +1 Mi. Show 

that if k > n, then [sy (x) — Sn(x)| < ra; conclude that |s (x) — sa(x)| < ra.) 

12. Prove continuity of the algebraic operations on R, as follows: Use the metric 
d(a,b) = |a — b| on R and the metric on R? given by the equation 


P((x, y), (x0, yo)) = max{|x — xot, ly — yol}. 


136 Topological Spaces and Continuous Functions Ch. 2 


(a) Show that addition is continuous. [Hint: Given €, let ô = €/2 and note that 
d(x + y, xo + yo) < |x — xol + ly — yol-] 


(b) Show that multiplication is continuous. [Hint: Given (xo, yo) and O < € < 
1, let 


38 = €/(Ixol + lyol + 1) 
and note that 
d(xy, xoyo) < Ixolly — yol + lyollx — xol + Ix — xolly — yol-] 


(c) Show that the operation of taking reciprocals is a continuous map from 
R — {0} to R. {Hint: Show the inverse image of the interval (a, b) is open. 
Consider five cases, according as a and b are positive, negative, or zero.} 

(d) Show that the subtraction and quotient operations are continuous. 


*§22 The Quotient Topology‘ 


Unlike the topologies we have already considered in this chapter, the quotient topology 
is not a natural generalization of something you have already studied in analysis. Nev- 
ertheless, it is easy enough to motivate. One motivation comes from geometry, where 
one often has occasion to use “cut-and-paste” techniques to construct such geometric 
objects as surfaces. The torus (surface of a doughnut), for example, can be constructed 
by taking a rectangle and “pasting” its edges together appropriately, as in Figure 22.1. 
And the sphere (surface of a ball) can be constructed by taking a disc and collapsing 
its entire boundary to a single point; see Figure 22.2. Formalizing these constructions 
involves the concept of quotient topology. 


Figure 22.1 


This section will be used throughout Part I of the book. It also is referred to in a number of 
exercises of Part 1. 
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Figure 22.2 


Definition. Let X and Y be topological spaces; let p : X — Y bea surjective map. 
The map p is said to be a quotient map provided a subset U of Y is open in Y if and 
only if p~'(U) is open in X. 


This condition is stronger than continuity; some mathematicians call it “strong 
continuity.” An equivalent condition is to require that a subset A of Y be closed in Y 
if and only if p—'(A) is closed in X. Equivalence of the two conditions follows from 
equation 


fl - B) = X - f~'(B). 


Another way of describing a quotient map is as follows: We say that a subset C 
of X is saturated (with respect to the surjective map p : X — Y) if C contains every 
set p~'({y}) that it intersects. Thus C is saturated if it equals the complete inverse 
image of a subset of Y. To say that p is a quotient map is equivalent to saying that p is 
continuous and p maps saturated open sets of X to open sets of Y (or saturated closed 
sets of X to closed sets of Y). 

Two special kinds of quotient maps are the open maps and the closed maps. Recall 
that a map f : X — Y is said to be an open map if for each open set U of X, the 
set f(U) is open in Y. It is said to be a closed map if for each closed set A of X, the 
set f (A) is closed in Y. It follows immediately from the definition that if p : X > Y 
is a surjective continuous map that is either open or closed, then p is a quotient map. 
There are quotient maps that are neither open nor closed. (See Exercise 3.) 


EXAMPLE |. Let X be the subspace [0, i] U [2, 3} of R, and let Y be the subspace [0, 2] 
of R. The map p : X — Y defined by 


(x) x for x € [0, l}, 
xy= 
p x-1 forx € [2,3] 
is readily seen to be surjective, continuous, and closed. Therefore it is a quotient map. It is 
not, however, an open map; the image of the open set [0, 1] of X is not open in Y. 

Note that if A is the subspace [0, 1) U [2, 3] of X, then the map q : A —> Y obtained 
by restricting p is continuous and surjective, but it is not a quotient map. For the set [2, 3] 
is open in A and is saturated with respect to q, but its image is not open in Y. 
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EXAMPLE 2. Let x, : R x R — R be projection onto the first coordinate; then 7 is 
continuous and surjective. Furthermore, 7; is an open map. For if U x V is a nonempty 
basis element for R x R, then x (U x V) = U is open in R; it follows that x; carries open 
sets of R x R to open sets of R. However, 7; is not a closed map. The subset 


C={xxylxy=1}} 


of R x Ris closed, but 7;(C) = R — {0}, which is not closed in R. 
Note that if A is the subspace of R x R that is the union of C and the ongin (0}, then 
the map q - A — R obtained by restricting 7, is continuous and surjective, but it is not a 
quotient map. For the one-point set {0} is open in A and is saturated with respect to q, but 
its image is not open in R. 
Now we show how the notion of quotient map can be used to construct a topology 
on a set. 


Definition. If X is a space and A is a set and if p : X — A is a surjective map, then 
there exists exactly one topology 7 on A relative to which p is a quotient map; it is 
called the quotient topology induced by p. 

The topology 7 is of course defined by letting it consist of those subsets U of A 


such that p~!(U) is open in X. It is easy to check that 7 is a topology. The sets Ø 
and A are open because p'a) = @ and pA) = X. The other two conditions 


follow from the equations 
p~] Ua) =U P~ Ua), 
aeJ aes 


n n 
PNUD = NP U». 
i=l i=l 
EXAMPLE 3. Let p be the map of the real line R onto the three-point set A = (a, b, c} 
defined by 
a ifx>9, 
p&x)= {b ifx <0, 
c ifx=0. 


You can check that the quotient topology on A induced by p is the one indicated in Fig- 
ure 22.3. 


Figure 22.3 


There is a special situation in which the quotient topology occurs particularly fre- 
quently. It is the following: 
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Definition. Let X be a topological space, and let X* be a partition of X into disjoint 
subsets whose union is X. Let p : X —> X* be the surjective map that carries each 
point of X to the element of X* containing it. In the quotient topology induced by p, 
the space X* is called a quotient space of X. 


Given X*, there is an equivalence relation on X of which the elements of X* are 
the equivalence classes. One can think of X* as having been obtained by “ide ntifying” 
each pair of equivalent points. For this reason, the quotient space X* is often called an 
identification space, or a decomposition space, of the space X. 

We can describe the topology of X* in another way. A subset U of X* is a col- 
lection of equivalence classes, and the set p`! {U ) is just the union of the equivalence 
classes belonging to U. Thus the typical open set of X* is a collection of equivalence 
classes whose union is an open set of X. 


EXAMPLE 4. Let X be the closed unit ball 
(x xylx? +y <I) 


in R?, and let X* be the partition of X consisting of all the one-point sets {x x y} for 
which x? + y? < 1, along with the set S! = {x x y} | x? + y? = 1}. Typical saturated 
open sets in X are pictured by the shaded regions in Figure 22.4. One can show that X* is 
homeomorphic with the subspace of R? called the unit 2-sphere, defined by 


SL =i, yz) +y + l 


Pp 
p(V) 
Figure 22.4 


EXAMPLE 5. Let X be the rectangle [0, 1} x [0, 1]. Define a partition X* of X as follows: 
Tt consists of all the one-point sets {x x y} where 0 < x < landO < y < 1, the following 
types of two-point sets: 


{(xx0 xxl} whereOQ <x <1, 
{Ox y, xy} whereO<y<l], 


and the four-point set 
{0x0,0x 1,1 x0O,b x 1}. 


Typical saturated open sets in X are pictured by the shaded regions in Figure 22.5; each is 
an open Set of X that equals a union of elements of X*. 
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The image of each of these sets under p is an open set of X*, as indicated in Fig- 
ure 22.6. This description of X* is just the mathematical way of saying what we expressed 
in pictures when we pasted the edges of a rectangle together to form a torus. 


Figure 22.6 


Now we explore the relationship between the notions of quotient map and quo- 
tient space and the concepts introduced previously. It is interesting to note that this 
relationship is not as simple as one might wish. 

We have already noted that subspaces do not behave well; if p : X + Y isa 
quotient map and A is a subspace of X, then the map q : A — p(A) obtained by 
restricting p need not be a quotient map. One has, however, the following theorem: 


Theorem 22.1. Let p : X — Y be a quotient map; let A be a subspace of X that is 

saturated with respect to p; letq : A — p(A) be the map obtained by restricting p. 
(1) If A is either open or closed in X, then q is a quotient map. 

*(2) If p is either an open map or a closed map, then q is a quotient map. 


Proof. Step 1. We verify first the following two equations: 


q'(V) = p7'(Vv) if V C p(A); 
p(UUNA)=p(U)N p(A) ifUCX. 
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To check the first equation, we note that since V C p(A) and A is saturated, p~'(V) 
is contained in A. It follows that both p~!(V) and qg~'(V) equal all points of A that 
are mapped by p into V. To check the second equation, we note that for any two 
subsets U and A of X, we have the inclusion 


p(U N A) C p(U)N plA). 


To prove the reverse inclusion, suppose y = p(u) = p(a), foru € U anda A. 
Since A is saturated, A contains the set p~'(p(a)), so that in particular A contains u. 
Then y = p(u), whereu € UNA. 

Step 2. Now suppose A is open or p is open. Given the subset V of p(A), we 
assume that g~!(V) is open in A and show that V is open in p(A). 

Suppose first that A is open. Since q~!(V) is open in A and A is open in X, the 
set g~!(V) is open in X. Since q~!(V) = p—!(V), the latter set is open in X, so that 
V is open in Y because p is a quotient map. In particular, V is open in p(A). 

Now suppose p is open. Since q~'(V) = p~'(V) and q~'(V) is open in A, we 
have p~'(V) = UNA for some set U open in X. Now p(p~!(V)) = V because p is 
surjective; then 


V = p(p'(V)) = pU N A) = p(U)N piA). 


The set p(U) is open in Y because p is an open map; hence V is open in p(A). 
Step 3. The proof when A or p is closed is obtained by replacing the word “open” 
by the word “closed” throughout Step 2. a 


Now we consider other concepts introduced previously. Composites of maps be- 
have nicely; it is easy to check that the composite of two quotient maps is a quotient 
map; this fact follows from the equation 


p~! (U)) = (q o py"). 


On the other hand, products of maps do not behave well; the cartesian product of 
two quotient maps need not be a quotient map. See Example 7 following. One needs 
further conditions on either the maps or the spaces in order for this statement to be 
true. One such, a condition on the spaces, is called local compactness, we shall study 
it later. Another, a condition on the maps, is the condition that both the maps p and q 
be open maps. In that case, it is easy to see that p x q is also an open map, so itis a 
quotient map. 

Finally, the Hausdorff condition does not behave well; even if X is Hausdorff, 
there is no reason that the quotient space X* needs to be Hausdorff. There is a simple 
condition for X* to satisfy the T) axiom; one simply requires that each element of the 
partition X* be a closed subset of X. Conditions that will ensure X* is Hausdorff are 
harder to find. This is one of the more delicate questions concerning quotient spaces; 
we shall retum to it several times later in the book. 

Perhaps the most important result in the study of quotient spaces has to do with the 
problem of constructing continuous functions on a quotient space. We consider that 
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problem now. When we studied product spaces, we had a criterion for determining 
whether a map f : Z — [] Xq into a product space was continuous. Its counterpart in 
the theory of quotient spaces is a criterion for determining when a map f : X* > Z 
out of a quotient space is continuous. One has the following theorem: 


Theorem 22.2. Let p : X — Y be a quotient map. Let Z be a space and let 
8 : X — Z bea map that is constant on each set p~'({y}), fory € Y. Then g induces 
a map f : Y — Z such that f o p = g. The induced map f is continuous if and only 
if g is continuous; f is a quotient map if and only if g is a quotient map. 


X 
D 
P 

Y >Z 


f 


Proof. For each y € Y, the set g(p—'({y})) isa one-point set in Z (since g is constant 
on p~!({y})). If we let f(y) denote this point, then we have defined a map f : Y > Z 
such that for each x € X, f(p(x)) = g(x). If f is continuous, then g = f o pis 
continuous. Conversely, suppose g is continuous. Given an open set V of Z, g~'(V) 
is open in X. But g~'(V) = p—!(f-'(V)); because p is a quotient map, it follows 
that f—'(V) is open in Y. Hence f is continuous. 

If f is a quotient map, then g is the composite of two quotient maps and is thus a 
quotient map. Conversely, suppose that g is a quotient map. Since g is surjective, so 
is f. Let V be a subset of Z; we show that V is open in Z if f—'(V) is open in Y. 
Now the set p~!( f~!(V)) is open in X because p is continuous. Since this set equals 
g '(V), the latter is open in X. Then because g is a quotient map, V is open in Z. @ 


Corollary 22.3. Let g : X — Z bea surjective continuous map. Let X* be the 
following collection of subsets of X : 


X* = {g7'({z}) | z € Z}. 


Give X* the quotient topology. 
(a) The map g induces a bijective continuous map f : X* — Z, which is a homeo- 
morphism if and only if g is a quotient map. 


(b) If Z is Hausdorff, so is X*. 


Proof. By the preceding theorem, g induces a continuous map f : X* > Z; it is 
clear that f is bijective. Suppose that f is a homeomorphism. Then both f and the 
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projection map p : X —> X* are quotient maps, so that their composite q is a quotient 
map. Conversely, suppose that g is a quotient map. Then it follows from the preceding 
theorem that f is a quotient map. Being bijective, f is thus a homeomorphism. 
Suppose Z is Hausdorff. Given distinct points of X*, their images under f are 
distinct and thus possess disjoint neighborhoods U and V. Then f~'(U) and f-!(V) 
are disjoint neighborhoods of the two given points of X*. a 


EXAMPLE 6. Let X be the subspace of R? that is the union of the line segments [0, 1] x 
{n}, for n € Z4, and let Z be the subspace of R? consisting of all points of the form 
x x (x/n) for x € [0, l] andn € Z+. Then X is the union of countably many disjoint 
line segments, and Z is the union of countably many line segments having an end point in 
common. See Figure 22.7. 

Define a map g : X ~ Z by the equation g(x x n) = x x (x/n); then g is surjective 
and continuous. The quotient space X* whose elements are the sets @ (fz) is simply the 
space obtained from X by identifying the subset {0} x Z, to a point. The map g induces a 
bijective continuous map f : X* —> Z. But f is not ahomeomorphism. 

To verify this fact, it suffices to show that g is not a quotient map. Consider the 
sequence of points x, = (1/n) x n of X. The set A = {xp} is a closed subset of X because 
it has no limut points. Also, it is saturated with respect to g. On the other hand, the set ¢(A) 
is not closed in Z, for it consists of the points zn = (1/n) x (1/n7); this set has the origin 
as a limit point. 


Figure 22.7 


EXAMPLE 7. The product of two quotient maps need not be a quotient map 

We give an example that involves non-Hausdorff spaces in the exercises. Here is an- 
other involving spaces that are nicer. 

Let X = R and let X* be the quotient space obtained from X by identifying the 
subset Z4 to a point b; let p : X — X* be the quotient map. Let Q be the subspace of R 
consisting of the rational numbers; let i : Q — Q be the identity map. We show that 


pxi:XxQ—>X*xQ 


is not a quotient map. 

For each n, let c, = /2/n, and consider the straight lines in R? with slopes 1 and ~l, 
respectively, through the point n x ca. Let U, consist of all points of X x Q that lie above 
both of these lines or beneath both of them, and also between the vertical lines x = n — 1/4 
and x =n+1/4. Then Un is open in X x Q; it contains the set {n} x Q because cp is not 
rational. See Figure 22.8. 
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Let U be the union of the sets Un; then U is open in X x Q. It is saturated with respect 
to p x i because it contains the entire set Z+} x {q} for each q € Q. We assume that 
U’ = (p x i)(U) is open in X* x Q and denve a contradiction. 

Because U contains, in particular, the set Z4 x 0, the set U” contains the point b x 0. 
Hence U’ contains an open set of the form W x Js, where W is a neighborhood of b in X* 
and Js consists of all rational numbers y with |y| < 6. Then 


p'(W) x Is CU. 


Choose n large enough that cn < ô. Then since p~!(W) is open in X and contains Z4, 
we can choose € < 1/4 so that the interval (n — €, n + €) is contained in p7'(W). Then 
U contains the subset V = (n — e,n +€) x fs of X x Q. But the figure makes clear that 
there are many points x x y of V that do not lie in U ! (One such is the point x x y, where 
x=n+ le and y is a rational number with |y — cn] < i) 


Figure 22.8 


Exercises 


1. Check the details of Example 3. 

2. (a) Let p : X — Y be a continuous map. Show that if there is a continuous map 
f : Y — X such that po f equals the identity map of Y, then p is a quotient 
map. 

(b) If A C X, aretraction of X onto A is a continuous map r : X —> A such 
that r(a) = a foreach a € A. Show that a retraction is a quotient map. 
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3. Let 7, : R x R — R be projection on the first coordinate. Let A be the subspace 
of R x R consisting of all points x x y for which either x > Oor y = 0 Cor both); 
let q : A — R be obtained by restricting x1. Show that q is a quotient map that 
is neither open nor closed. 


4. (a) Define an equivalence relation on the plane X = R? as follows: 
xox yo ~x xy if x +y =x +y. 


Let X* be the corresponding quotient space. It is homeomorphic to a familiar 
space; what is it? [Hint: Set g(x x y) =x + y*.] 
(b) Repeat (a) for the equivalence relation 


Xox yo ~x xy if xp + yp =x? + y?. 


5. Let p : X — Y be an open map. Show that if A is open in X, then the map 
q : A — p(A) obtained by restricting p is an open map. 

6. Recall that Rx denotes the real line in the K-topology. (See §13.) Let Y be 
the quotient space obtained from Rx by collapsing the set K to a point; let 
p : Rx — Y be the quotient map. 
(a) Show that Y satisfies the T} axiom, but is not Hausdorff. 
(b) Show that p x p : Rx x Rx — Y x Y is not a quotient map. [Hinr: The 

diagonal is not closed in Y x Y, but its inverse image is closed in Rx x Rx.] 


*Supplementary Exercises: Topological Groups 


In these exercises we consider topological groups and some of their properties. The 
quotient topology gets its name from the special case that arises when one forms the 
quotient of a topological group by a subgroup. 

A topological group G is a group that is also a topological space satisfying the 
T, axiom, such that the map of G x G into G sending x x y into x - y, and the 
map of G into G sending x into x~', are continuous maps. Throughout the following 
exercises, let G denote a topological group. 

1. Let H denote a group that is also a topological space satisfying the T) axiom. 
Show that H is a topological group if and only if the map of H x H into H 
sending x x y into x - y~! is continuous. 

2. Show that the following are topological groups: 

(a) (Z, +) 

(b) (R, +) 

(c) (Re, -) 

(d) (S!, -), where we take S! to be the space of all complex numbers z for which 
jz] = 1. 
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(e) The general linear group GL(n), under the operation of matrix multiplica- 
tion. (GL(n) is the set of all nonsingular n by n matrices, topologized by 
considering it as a subset of euclidean space of dimension n? in the obvious 
way.) 

3. Let H be a subspace of G. Show that if H is also a subgroup of G, then both H 
and H are topological groups. 


4. Let a be an element of G. Show that the maps fa, 8a : G — G defined by 
fa(x)=a@-x and gg(x)=x-a 


are homeomorphisms of G. Conclude that G is a homogeneous space. (This 

means that for every pair x, y of points of G, there exists a homeomorphism 

of G onto itself that carries x to y.) 

5. Let H be a subgroup of G. If x € G, define xH = (x -h | h € Hy, this set is 
called a left coset of H in G. Let G/H denote the collection of left cosets of H 
in G; it is a partition of G. Give G/H the quotient topology. 

(a) Show that if a € G, the map fa of the preceding exercise induces a home- 
omorphism of G/H carrying xH to (a - x)H. Conclude that G/H is a 
homogeneous space. 

(b) Show that if H is a closed set in the topology of G, then one-point sets are 
closed in G/H. 

(c) Show that the quotient map p : G — G/H is open. 

(d) Show that if H is closed in the topology of G and is a normal subgroup of G, 
then G/H is a topological group. 

6. The integers Z are a normal subgroup of (R, +). The quotient R/Z is a familiar 
topological group; what is it? 

7. If A and B are subsets of G, let A - B denote the set of all points a - b fora € A 
and b € B. Let Aq! denote the set of all points a~', fora € A. 

(a) A neighborhood V of the identity element e is said to be symmetric if V = 
V-!. If U is a neighborhood of e, show there is a symmetric neighborhood 
V of e such that V- V C U. [Hint: If W is a neighborhood of e, then 
W - W7! is symmetric.] 

Show that G is Hausdorff. In fact, show that if x Æ y, there is a neighbor- 

hood V of e such that V - x and V - y are disjoint. 

(c) Show that G satisfies the following separation axiom, which is called the 

regularity axiom: Given a closed set A and a point x not in A, there ex- 
ist disjoint open sets containing A and x, respectively. [Hint: There is a 
neighborhood V of e such that V - x and V - A are disjoint.] 

(d) Let H be a subgroup of G that is closed in the topology of G; let p: G > 

G/H be the quotient map. Show that G/H satisfies the regularity axiom. 
(Hint: Examine the proof of (c) when A is saturated.] 


(b 


~ 


Chapter 3 


Connectedness 
and Compactness 


In the study of calculus, there are three basic theorems about continuous functions, 
and on these theorems the rest of calculus depends. They are the following: 

Intermediate value theorem. If f : [a,b] — R is continuous and if r is a real 
number between f(a) and f(b), then there exists an element c € [a,b] such that 
fo =r. 

Maximum value theorem. If f : [a,b] — R is continuous, then there exists an 
element c € [a, b] such that f(x) < f(c) for every x e€ [a, b]. 

Uniform continuity theorem. If f : [a,b] — R is continuous, then given € > 0, 
there exists ô > 0 such that | f(x1) — f(x2)| < € for every pair of numbers x), x2 
of [a, b] for which |x, — x2] < ô. 

These theorems are used in a number of places. The intermediate value theorem is 
used for instance in constructing inverse functions, such as 3/x and arcsin x; and the 
maximum value theorem is used for proving the mean value theorem for derivatives, 
upon which the two fundamental theorems of calculus depend. The uniform continuity 
theorem is used, among other things, for proving that every continuous function is 
integrable. 

We have spoken of these three theorems as theorems about continuous functions. 
But they can also be considered as theorems about the closed interval [a, b] of real 
numbers. The theorems depend not only on the continuity of f but also on properties 
of the topological space [a. b}. 

The property of the space [a, b] on which the intermediate value theorem depends 
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is the property called connectedness, and the property on which the other two depend 
is the property called compactness. In this chapter, we shall define these properties for 
arbitrary topological spaces, and shall prove the appropriate generalized versions of 
these theorems. 

As the three quoted theorems are fundamental for the theory of calculus, so are the 
notions of connectedness and compactness fundamental in higher analysis, geometry, 
and topology—indeed, in almost any subject for which the notion of topological space 
itself is relevant. 


§23 Connected Spaces 


The definition of connectedness for a topological space is a quite natural one. One says 
that a space can be “separated” if it can be broken up into two “globs”—disjoint open 
sets. Otherwise, one says that it is connected. From this simple idea much follows. 


Definition. Let X be atopological space. A separation of X is a pair U, V of disjoint 
nonempty open subsets of X whose union is X. The space X is said to be connected 
if there does not exist a Separation of X. 


Connectedness is obviously a topological property, since it is formulated entirely 
in terms of the collection of open sets of X. Said differently, if X is connected, so is 
any space homeomorphic to X. 

Another way of formulating the definition of connectedness is the following: 


A space X is connected if and only if the only subsets of X that are both 
open and closed in X are the empty set and X itself. 


For if A is a nonempty proper subset of X that is both open and closed in X, then the 
sets U = Aand V = X — A constitute a separation of X, for they are open, disjoint, 
and nonempty, and their union is X. Conversely, if U and V form a separation of X, 
then U is nonempty and different from X, and it is both open and closed in X. 

For a subspace Y of a topological space X, there is another useful way of formu- 
lating the definition of connectedness: 


Lemma 23.1. If Y is a subspace of X, a separation of Y is a pair of disjoint nonempty 
sets A and B whose union is Y, neither of which contains a limit point of the other. 
The space Y is connected if there exists no separation of Y. 


Proof. Suppose first that A and B form a separation of Y. Then A is both open and 
closed in Y. The closure of A in Y is the set AN Y (where A as usual denotes the 
closure of A in X). Since A is closed in Y, A = ANY; or to say the same thing, 
ANB = Ø. Since A is the union of A and its limit points, B contains no limit points 
of A. A similar argument shows that A contains no limit points of B. 

Conversely, suppose that A and B are disjoint nonempty sets whose union is Y, 
neither of which contains a limit point of the other. Then ANB =Øand ANB = D; 
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therefore, we conclude that ANY = Aand BNY = B. Thus both A and B are closed 
in Y, and since A = Y — B and B = Y — A, they are open in Y as well. a 


EXAMPLE |. Let X denote a two-point space in the indiscrete topology. Obviously there 
is no separation of X, so X is connected. 


EXAMPLE 2. Let Y denote the subspace [—1, 0) U (0, L] of the real line R. Each of the 
sets (—1, 0) and (0, 1] is nonempty and open in Y (although not in R}; therefore, they form 
a separation of Y. Alternatively, note that neither of these sets contains a limit point of the 
other. (They do have a limit point 0 in common, but that does not matter.) 


EXAMPLE 3. Let X be the subspace [—1, 1] of the real line. The sets [—1, 0} and (0, }] 
are disjoint and nonempty, but they do not form a separation of X, because the first set is 
not open in X. Alternatively, noie that the first set contains a limit point, 0, of the second, 
Indeed, there exists no separation of the space [—1, 1]. We shall prove this fact shortly. 


EXAMPLE 4. The rationals Q are not connected. Indeed, the only connected subspaces 
of Q are the one-point sets: If Y is a subspace of Q containing two points p and q, one can 
choose an irrational number a lying between p and q, and write Y as the union of the open 
sets 


YN(—oo,a) and YN (a, +00). 
EXAMPLE 5. Consider the following subset of the plane R?: 
X={xxy|y=O)U{x x y|x > Qand y = 1/x}. 


Then X is not connected; indeed, the two indicated sets form a separation of X because 
neither contains a limit point of the other. See Figure 23.1. 


Figure 23.1 


We have given several examples of spaces that are not connected. How can one 
construct spaces that are connected? We shall now prove several theorems that tell 
how to form new connected spaces from given ones. In the next section we shall apply 
these theorems to show that some specific spaces, such as intervals in R, and balls and 
cubes in R”, are connected. First, a lemma: 


Lemma 23.2. If the sets C and D form a separation of X, and if Y is a connected 
subspace of X, then Y lies entirely within either C or D. 


Proof. Since C and D are both open in X, the sets CN Y and D N Y are open in Y. 
These two sets are disjoint and their union is Y; if they were both nonempty, they 
would constitute a separation of Y. Therefore, one of them is empty. Hence Y must 
lie entirely in C or in D. a 
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Theorem 23.3. The union of a collection of connected subspaces of X that have a 
point in common is connected. 


Proof. Let {Aq} be acollection of connected subspaces of a space X; let p be a point 
of (| Aw. We prove that the space Y = |_) Ag is connected. Suppose that Y = C U D 
is a separation of Y. The point p is in one of the sets C or D; suppose p € C. 
Since Ag is connected, it must lie entirely in either C or D, and it cannot lie in D 
because it contains the point p of C. Hence Ag C C for every a, so that LJ Ag C C, 
contradicting the fact that D is nonempty. a 


Theorem 23.4. Let A be a connected subspace of X. If A C B C A, then B is also 
connected. 


Said differently: If B is formed by adjoining to the connected subspace A some or 
all of its limit points, then B is connected. 
Proof. Let A be connected and let A C B C A. Suppose that B = CUDisa 
separation of B. By Lemma 23.2, the set A must lie entirely in C or in D; suppose 
that A C C. Then A C Č; since Č and D are disjoint, B cannot intersect D. This 
contradicts the fact that D is a nonempty subset of B. a 


Theorem 23.5. The image of a connected space under a continuous map is con- 
nected, 


Proof. Let f : X — Y be a continuous map; let X be connected. We wish to 
prove the image space Z = f(X) is connected. Since the map obtained from f by 
restricting its range to the space Z is also continuous, it suffices to consider the case 
of a continuous surjective map 


g:X>Z. 


Suppose that Z = A U B is a separation of Z into two disjoint nonempty sets open 
in Z. Then g7! (A) and g7? (B) are disjoint sets whose union is X; they are open in X 
because g is continuous, and nonempty because g is surjective. Therefore, they form 
a separation of X, contradicting the assumption that X is connected. a 


Theorem 23.6. A finite cartesian product of connected spaces is connected. 


Proof. We prove the theorem first for the product of two connected spaces X and Y. 
This proof is easy to visualize. Choose a “base point” a x b in the product X x Y. 
Note that the “horizontal slice” X x b is connected, being homeomorphic with X, and 
each “vertical slice” x x Y is connected, being homeomorphic with Y. As a result, 
each “T-shaped” space 


Ty, =(Xxb)U(xxY) 


is connected, being the union of two connected spaces that have the point x x b in 
common. See Figure 23.2. Now form the union |); ex Tx of all these T-shaped spaces. 
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This union is connected because it is the union of a collection of connected spaces that 
have the point a x b in common. Since this union equals X x Y, the space X x Y is 
connected. 


Y xxY 
axb 
b Xxb 
x 
x a 
Figure 23.2 


The proof for any finite product of connected spaces follows by induction, using 
the fact (easily proved) that X; x --- x Xn is homeomorphic with (X; x ---* Xn-1) X 
Xn. a 


It is natural to ask whether this theorem extends to arbitrary products of connected 
spaces. The answer depends on which topology is used for the product, as the follow- 
ing examples show. 


EXAMPLE 6. Consider the cartesian product R® in the box topology. We can write RY 
as the union of the set A Consisting of all bounded sequences of real numbers, and the set B 
of all unbounded sequences. These Sets are disjoint, and each is open in the box topology. 
For if a is a point of R®, the open set 


U = (a; - l,a} + l) x (a2 — l,a + 1) x- 


consists entirely of bounded sequences if a is bounded, and of unbounded sequences if a if 
unbounded. Thus, even though R is connected (as we shall prove in the next section), R” 
is not connected in the box topology. 


EXAMPLE 7. Now consider R® in the product topology. Assuming that R is con- 
nected, we show that R® is connected. Let R” denote the subspace of R” consisting of 
all sequences x = (x1, .«2,...} such that x; = 0 fori > n. The space R” is clearly 
homeomorphic to R”, so that it is connected, by the preceding theorem. It follows that the 
space R® that is the union of the spaces IR” is connected, for these spaces have the point 
0 = (0,0, ...) in common. We show that the closure of R™ equals all of R”, from which 
it follows that R® is connected as well. 

Let a = (a1, a2,...) be a point of R”. Let U = J] U; be a basis element for the 
product topology that contains a. We show that U intersects R° . There is an integer N 
such that U, = R for i > N. Then the point 


X = (a],...,4,,0,0,...) 


of R” belongs to U, since a; € U; for alli, and O € U; fori > N. 


152 


Connectedness and Compactness Ch. 3 


The argument just given generalizes to show that an arbitrary product of connected 


spaces is connected in the product topology. Since we shall not need this result, we 
leave the proof to the exercises. 


Exercises 


1. 


Let 7 and J’ be two topologies on X. If 7’ > F, what does connectedness 
of X in one topology imply about connectedness in the other? 


. Let {A,} be a sequence of connected subspaces of X, such that An N Anyi # Ø 


for all n. Show that (_) A, is connected. 


. Let {Aa} be a collection of connected subspaces of X; let A be a connected 


subspace of X. Show that if AN Ag # Ø for all a, then AU(\_) Aq) is connected. 


4. Show that if X is an infinite set, it is connected in the finite complement topology. 


10. 


11. 


12. 


. A space is totally disconnected if its only connected subspaces are one-point 


sets. Show that if X has the discrete topology, then X is totally disconnected. 
Does the converse hold? 


. Let A C X. Show that if C is a connected subspace of X that intersects both A 


and X — A, then C intersects Bd A. 


- Is the space Ry connected? Justify your answer. 
. Determine whether or not R® is connected in the uniform topology. 
. Let A be a proper subset of X, and let B be a proper subset of Y. If X and Y are 


connected, show that 
(X x Y)- (A x B) 


is connected. 
Let {Xalacs be an indexed family of connected spaces; let X be the product 


space 
X=] | Xe. 


aed 


Let a = (aq) be a fixed point of X. 

(a) Given any finite subset K of J, let Xx denote the subspace of X consisting 
of all points x = (xq) such that xy = dg fora ¢ K. Show that Xx is 
connected. 

(b) Show that the union Y of the spaces X x is connected. 

(c) Show that X equals the closure of Y; conclude that X is connected. 


Let p : X — Y be a quotient map. Show that if each set p~!({y}) is connected, 
and if Y is connected, then X is connected. 

Let Y C X; let X and Y be connected. Show that if A and B form a separation 
of X — Y, then Y UA and Y UB are connected. 
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§24 Connected Subspaces of the Real Line 


The theorems of the preceding section show us how to construct new connected spaces 
out of given ones. But where can we find some connected spaces to start with? The 
best place to begin is the real line. We shall prove that R is connected, and so are the 
intervals and rays in R. 

One application is the intermediate value theorem of calculus, suitably general- 
ized. Another is the result that such familiar spaces as balls and spheres in euclidean 
space are connected; the proof involves a new notion, called path connectedness, 
which we also discuss. 

The fact that intervals and rays in R are connected may be familiar to you from 
analysis. We prove it again here, in generalized form. It turns out that this fact does 
not depend on the algebraic properties of R, but only on its order properties. To make 
this clear, we shall prove the theorem for an arbitrary ordered set that has the order 
properties of R. Such a set is called a linear continuum. 


Definition. A simply ordered set L having more than one element is called a linear 
continuum if the following hold: 

(1) L has the least upper bound property. 

(2) If x < y, there exists z such thatx < z < y. 


Theorem 24.1. If L is a linear continuum in the order topology, then L is connected, 
and so are intervals and rays in L. 


Proof. Recall that a subspace Y of L is said to be convex if for every pair of points 
a, b of Y witha < b, the entire interval {a, b] of points of L lies in Y. We prove that 
if Y is a convex subspace of L, then Y is connected. 

So suppose that Y is the union of the disjoint nonempty sets A and B, each of 
which is open in Y. Choose a € A and b € B; suppose for convenience that a < b. 
The interval (a, b] of points of L is contained in Y. Hence [a, b] is the union of the 
disjoint sets 


Ao = AN[a,b] and Boọ= B N[a,b], 


each of which is open in [a, b] in the subspace topology, which is the same as the order 
topology. The sets Ao and Bo are nonempty because a € Ag and b € Bo. Thus, Ao 
and Bo constitute a separation of [a, b]. 

Let c = sup Ao. We show that c belongs neither to Ag nor to Bo, which contradicts 
the fact that [a, b] is the union of Ag and Bo. 


Case 1. Suppose that c € Bo. Then c Æ a, so either c = bora < c < b. In 
either case, it follows from the fact that Bo is open in [a, b] that there is some interval 
of the form (d, c] contained in Bo. If c = b, we have a contradiction at once, for d isa 
smaller upper bound on Ao than c. If c < b, we note that (c, b] does not intersect Ag 
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(because c is an upper bound on Ag). Then 
(d, b] = (d, c] U (c, b] 


does not intersect Ag. Again, d is a smaller upper bound on Apo than c, contrary to 
construction. See Figure 24.1. 


d c c e 
e—a Tr re | 
a b a7 b 

d c c e 
e ee 
a b a = b 

Figure 24.1 Figure 24.2 


Case 2. Suppose that c € Ag. Then c Æ b, so either c = aora <c <b. 
Because Ao is open in fa, b], there must be some interval of the form [c, e) contained 
in Ag. See Figure 24.2. Because of order property (2) of the linear continuum L, we 
can choose a point z of L such that c < z < e. Then z € Ap, contrary to the fact that 
c is an upper bound for Ao. a 


Corollary 24.2. The real line R is connected and so are intervals and rays in R. 


As an application, we prove the intermediate value theorem of calculus, suitably 
generalized. 


Theorem 24.3 (Intermediate value theorem). Let f : X — Y be a continuous 
map, where X is a connected space and Y is an ordered set in the order topology. If a 
and b are two points of X and ifr is a point of Y lying between f(a) and f(b), then 
there exists a point c of X such that f (c) =r. 


The intermediate value theorem of calculus is the special case of this theorem that 
occurs when we take X to be a closed interval in R and Y to be R. 


Proof. Assume the hypotheses of the theorem. The sets 
A= f(X)N(-o0,r) and B= f(X)N (r, +00) 


are disjoint, and they are nonempty because one contains f(a) and the other con- 
tains f(b). Each is open in f(X), being the intersection of an open ray in Y with f (X). 
If there were no point c of X such that f (c) = r, then f(X) would be the union of the 
sets A and B. Then A and B would constitute a separation of f (X), contradicting the 
fact that the image of a connected space under a continuous map is connected. a 
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EXAMPLE 1. One example of a linear continuum different from R is the ordered square. 
We check the least upper bound property. (The second property of a linear continuum is 
trivial to check.) Let A be a subset of 7 x /; let, : F x I — I be projection on the first 
coordinate; let b = sup zı (A). If b € 2, (A), then A intersects the subset b x J of I x I. 
Because b x / has the order type of 7, the set A N (b x /) will have a least upper bound 
b x c, which will be the least upper bound of A. See Figure 24.3. If b ¢ m1 (A), then b x 0 
is the least upper bound of A; no element of the form b’ x c with b’ < b can be an upper 
bound for A, for then b’ would be an upper bound for 7; (A). 


n,(A)x 0 


r(A) x 0 


Figure 24.3 


EXAMPLE 2. If X is a well-ordered set, then X x (0, 1) is a linear continuum in the 
dictionary order; this we leave to you to check. This set can be thought of as having been 
constructed by “fitting in” a set of the order type of (0, l) immediately following each 
element of X. 


Connectedness of intervals in R gives rise to an especially useful criterion for 
showing that a space X is connected; namely, the condition that every pair of points 
of X can be joined by a path in X: 


Definition. Given points x and y of the space X, a path in X from x to y isa 
continuous map f : [a,b] — X of some closed interval in the real line into X, such 
that f(a) = x and f(b) = y. A space X is said to be path connected if every pair of 
points of X can be joined by a path in X. 


It is easy to see that a path-connected space X is connected. Suppose X = AUB 
is a Separation of X. Let f : [a,b] — X be any path in X. Being the continuous 
image of a connected set, the set f([a, b]) is connected, so that it lies entirely in either 
A or B. Therefore, there is no path in X joining a point of A to a point of B, contrary 
to the assumption that X is path connected. 

The converse does not hold; a connected space need not be path connected. See 
Examples 6 and 7 following. 
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EXAMPLE 3. Define the unit ball B” in R" by the equation 
B” = (x | Axil < 1} 


where 

xt = Nea... andl = @ + R. 
The unit ball is path connected; given any two points x and y of B”, the straight-line path 
Jf : [0,1] — R” defined by 


ft) =(1—t)x+ty 
lies in B”. For if x and y are in 8” and ¢ is in [0, 1], 
IFW s -plx + allyl s 1. 


A similar argument shows that every open ball By(x, €) and every closed ball Ba (x. €) 
in R” is path connected. 


EXAMPLE 4. Define punctured euclidean space to be the space R” — {0}, where 0 is 
the origin in R”. Ifa > 1, this space is path connected: Given x and y different from 0, 
we can join x and y by the straight-line path between them if that path does not go through 
the origin. Otherwise, we can choose a point z not on the line joining x and y, and take the 
broken-line path from x to z, and then from z to y. 


EXAMPLE 5. Define the unit sphere S”! in R" by the equation 
T = {x] [xi] = 1). 


If n > 1, it is path connected. For the map g : R” ~ {0} + S"—! defined by g(x) = x/|[x!l 
is continuous and surjective; and it is easy to show that the continuous image of a path- 
connected space is path connected. 


EXAMPLE 6. The ordered square RP is connected but not path connected. 

Being a linear continuum, the ordered square is connected. Let p = 0 x 0 and q = 
1x 1. We suppose there is a path f : [a, b] > H joining p and q and derive a contradiction. 
The image set f([a, b]) must contain every point x x y of 12, by the intermediate value 
theorem. Therefore, for each x € /, the set 


U; = f'(x x (0, D) 


is a nonempty subset of [a, b]; by continuity, it is open in [a, b]. See Figure 24.4. Choose, 
for each x € 7, a rational number q, belonging to Ux. Since the sets U, are disjoint, the 
map x — qx is an injective mapping of J into Q. This contradicts the fact that the interval 7 
is uncountable (which we shall prove later). 


EXAMPLE 7. Let S denote the following subset of the plane. 
= {x x sin(1/x) |0 <x < 1}. 


Because S is the image of the connected set (0, 1] under a continuous map, S is connected. 
Therefore, its closure Š in R? is also connected. The set 5 is a classical example in topology 
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f 
TA x x (0, 1) 
U: 
a b p 
Figure 24.4 


called the topologist’s sine curve. It is illustrated in Figure 24.5; it equals the union of S 
and the vertical interval 0 x [—1, 1]. We show that S is not path connected. 

Suppose there is a path f : (a, c] — S beginning at the origin and ending at a point 
of S. The set of those t for which f(t) € 0 x [—1, 1] is closed, so it has a largest element b. 
Then f : [b,c] — S is a path that maps b into the vertical interval 0 x [—1, 1} and maps 
the other points of [b, c] to points of S. 

Replace [b, c] by [0, 1] for convenience; let f(t) = (x(t), y(t)). Then x(0) = 0, 
while x(t) > O and y(t) = sin(1/x(t)) fort > 0. We show there is a sequence of points 
ty — O such that y(t) = (—1)". Then the sequence y(t») does not converge, contradicting 
continuity of f. 

To find ¢,, we proceed as follows: Given n, choose u with O < u < x(1/m) such that 
sin(1/u) = (—1)". Then use the intermediate value theorem to find t, with O < tn < l/n 
such that x(t,_) = u. 


wn 


Figure 24.5 


Exercises 


1. (a) Show that no two of the spaces (0, 1), (0, 1], and (0, 1] are homeomorphic. 
(Hint: What happens if you remove a point from each of these spaces?)] 
(b) Suppose that there exist imbeddings f : X — Y and g : Y —> X. Show by 
means of an example that X and Y need not be homeomorphic. 
(c) Show R” and R are not homeomorphic if n > 1. 
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. Let f : S! — R be a continuous map. Show there exists a point x of S! such 


that f(x) = f(—x). 


. Let f : X — X be continuous. Show that if X = [0, 1], there is a point x such 


that f(x) = x. The point x is called a fixed point of f. What happens if X 
equals [0, 1) or (0, 1)? 


. Let X be an ordered set in the order topology. Show that if X is connected, then 


X isa linear continuum. 


. Consider the following sets in the dictionary order. Which are linear continua? 


(a) Z, x [0, 1) 
(b) [0, 1) x Z4 
(c) (0, 1) x (0, 1] 
(å) [0, 1) x (0, 1) 


. Show that if X is a well-ordered set, then X x (0, 1) in the dictionary order is a 


linear continuum. 


. (a) Let X and Y be ordered sets in the order topology. Show that if f : X > Y 


is order preserving and surjective, then f is a homeomorphism. 

(b) Let X = Y = R,. Given a positive integer n, show that the function f(x) = 
x” is order preserving and surjective. Conclude that its inverse, the nth root 
function, is continuous. 

(c) Let X be the subspace (—oo, —1) U [0, 00) of R. Show that the function 
f : X — R defined by setting f(x) = x + 1ifx < —l, and f(x) =x if 
x > Q, is order preserving and surjective. Is f a homeomorphism? Compare 
with (a). 


. (a) Is a product of path-connected spaces necessarily path connected? 


(b) If A C X and A is path connected, is A necessarily path connected? 

(c) If f : X — Y is continuous and X is path connected, is f(X) necessarily 
path connected? 

(d) If {Aq} is a collection of path-connected subspaces of X and if (| Ag £ Ø, 
is |) Ag necessarily path connected? 


. Assume that R is uncountable. Show that if A is a countable subset of R?, then 


R? — A is path connected. [Hint: How many lines are there passing through a 
given point of R??] 

Show that if U is an open connected subspace of R?, then U is path connected. 
(Hint: Show that given xo € U, the set of points that can be joined to xg by a 
path in U is both open and closed in U.} 


If A is a connected subspace of X, does it follow that Int A and Bd A are con- 
nected? Does the converse hold? Justify your answers. 


Recall that So denotes the minimal uncountable well-ordered set. Let L denote 
the ordered set Sg x [0, 1) in the dictionary order, with its smallest element 
deleted. The set L is a classical example in topology called the long line. 
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Theorem. The long line is path connected and locally homeomorphic to R, but 

it cannot be imbedded in R. 

(a) Let X be an ordered set; leta < b < c be points of X. Show that [a, c) has 
the order type of [0, 1) if and only if both [a, b) and [b, c) have the order 
type of [0, 1). 

(b) Let X be an ordered set. Let x9 < xı < --- be an increasing sequence of 
points of X; suppose b = sup{x,}. Show that [xo, b) has the order type of 
(0, 1) if and only if each interval [x,, xj41) has the order type of {O, I). 

(c) Let ag denote the smallest element of Sg. For each element a of Sp different 
from ag, show that the interval [ag x 0, a x 0) of Sg x (0, 1) has the order 
type of [0, 1). [Hint: Proceed by transfinite induction. Either a has an 
immediate predecessor in Sp, or there is an increasing sequence a, in So 
with a = sup{a;}.] 

(d) Show that L is path connected. 

(e) Show that every point of L has a neighborhood homeomorphic with an open 
interval in R. 

(f) Show that L cannot be imbedded in R, or indeed in R” for any n. [Hint: 
Any subspace of R” has a countable basis for its topology.] 


*§25 Components and Local Connectedness' 


Given an arbitrary space X, there is a natural way to break it up into pieces that are 
connected (or path connected). We consider that process now. 


Definition. Given X, define an equivalence relation on X by setting x ~ y if there 
is a connected subspace of X containing both x and y. The equivalence classes are 
called the components (or the “connected components”) of X. 


Symmetry and reflexivity of the relation are obvious. Transitivity follows by not- 
ing that if A is a connected subspace containing x and y, and if B is a connected 
subspace containing y and z, then A U B is a subspace containing x and z that is 
connected because A and B have the point y in common. 

The components of X can also be described as follows: 


Theorem 25.1. The components of X are connected disjoint subspaces of X whose 
union is X, such that each nonempty connected subspace of X intersects only one of 
them. 


Proof. Being equivalence classes, the components of X are disjoint and their union 
is X. Each connected subspace A of X intersects only one of them. For if A intersects 
the components C, and C2 of X, say in points x; and x2, respectively, then x; ~ x2 
by definition; this cannot happen unless C) = C2. 


tThis section will be assumed in Part II of the book. 
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To show the component C is connected, choose a point x9 of C. For each point x 
of C, we know that x9 ~ x, so there is a connected subspace A, containing xg and x. 
By the result just proved, A, C C. Therefore, 
C= UA. 
xeC 
Since the subspaces A, are connected and have the point xo in common, their union is 
connected. a 


Definition. We define another equivalence relation on the space X by defining x ~ y 
if there is a path in X from x to y. The equivalence classes are called the path compo- 
nents of X. 


Let us show this is an equivalence relation. First we note that if there exists a path 
f : [a,b] — X from x to y whose domain is the interval {a, b], then there is also 
a path g from x to y having the closed interval [c, d] as its domain. (This follows 
from the fact that any two closed intervals in R are homeomorphic.) Now the fact that 
x ~ x for each x in X follows from the existence of the constant path f : [a,b] > X 
defined by the equation f(t) = x for all t. Symmetry follows from the fact that if 
f : [0,1] — X isa path from x to y, then the “reverse path” g : [0,1] —> X defined 
by g(t) = f(1 —4) is a path from y to x. Finally, transitivity is proved as follows: Let 
f : (0,1) — X bea path from x to y, and let g : [1,2] — X bea path from y to z. 
We can “paste f and g together” to get a path h : [0,2] — X from x to z; the path k 
will be continuous by the “pasting lemma,” Theorem 18.3. 

One has the following theorem, whose proof is similar to that of the theorem pre- 
ceding: 


Theorem 25.2. The path components of X are path-connected disjoint subspaces 
of X whose union is X, such that each nonempty path-connected subspace of X inter- 
sects only one of them. 


Note that each component of a space X is closed in X, since the closure of a 
connected subspace of X is connected. If X has only finitely many components, then 
each component is also open in X, since its complement is a finite union of closed sets. 
But in general the components of X need not be open in X. 

One can say even less about the path components of X, for they need be neither 
open nor closed in X. Consider the following examples: 

EXAMPLE |. If Q is the subspace of R consisting of the rational numbers, then each 

component of Q consists of a single point. None of the components of Q are open in Q. 


EXAMPLE 2. The “topologist’s sine curve” $ of the preceding section is a space that has 
a single component (since it is connected) and two path components. One path component 
is the curve S and the other is the vertical interval V = 0 x [—1, 1]. Note that S is open 
in § but not closed, while V is closed but not open. 

If one forms a space from $ by deleting all points of V having rational second co- 
ordinate, one obtains a space that has only one component but uncountably many path 
components. 
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Connectedness is a useful property for a space to possess. But for some purposes, 
it is more important that the space satisfy a connectedness condition locally. Roughly 
speaking, local connectedness means that each point has “arbitrarily small” neighbor- 
hoods that are connected. More precisely, one has the following definition: 


Definition. A space X is said to be locally connected at x if for every neighbor- 
hood U of x, there is a connected neighborhood V of x contained in U. If X is locally 
connected at each of its points, it is said simply to be locally connected. Similarly, a 
space X is said to be locally path connected at x if for every neighborhood U of x, 
there is a path-connected neighborhood V of x contained in U. If X is locally path 
connected at each of its points, then it is said to be locally path connected. 


EXAMPLE 3. Each interval and each ray in the real line is both connected and locally 
connected. The subspace [—1, 0) U (0, 1} of R is not connected, but it is locally connected. 
The topologist’s sine curve is connected but not locally connected. The rationals Q are 
neither connected nor locally connected. 


Theorem 25.3. A space X is locally connected if and only if for every open set U 
of X, each component of U is open in X. 


Proof. Suppose that X is locally connected; let U be an open set in X; let C bea 
component of U. If x is a point of C, we can choose a connected neighborhood V of x 
such that V C U. Since V is connected, it must lie entirely in the component C of U. 
Therefore, C is open in X. 

Conversely, suppose that components of open sets in X are open. Given a point x 
of X and a neighborhood U of x, let C be the component of U containing x. Now C 
is connected; since it is open in X by hypothesis, X is locally connected at x. a 


A similar proof holds for the following theorem: 


Theorem 25.4. A space X is locally path connected if and only if for every open 
set U of X, each path component of U is open in X. 


The relation between path components and components is given in the following 
theorem: 


Theorem 25.5. If X is a topological space, each path component of X lies in a 
component of X. If X is locally path connected, then the components and the path 
components of X are the same. 


Proof. Let C be acomponent of X; let x be a point of C; let P be the path component 
of X containing x. Since P is connected, P C C. We wish to show that if X is locally 
path connected, P = C. Suppose that P ¢ C. Let Q denote the union of all the path 


162 Connectedness and Compactness Ch. 3 


components of X that are different from P and intersect C; each of them necessarily 
lies in C, so that 


C=PUQ. 


Because X is locally path connected, each path component of X is open in X. There- 
fore, P (which is a path component) and Q (which is a union of path components) 
are open in X, so they constitute a separation of C. This contradicts the fact that C is 
connected. a 


Exercises 


1. What are the components and path components of R¢? What are the continuous 
maps f : R > R:? 
2. (a) What are the components and path components of R” (in the product topol- 
ogy)? 
(b) Consider R” in the uniform topology. Show that x and y lie in the same 
component of R” if and only if the sequence 


x= y= (x1 — yi, 42 — y2,---) 


is bounded. [Hint: It suffices to consider the case where y = 0.] 

(c) Give R® the box topology. Show that x and y lie in the same component 
of R® if and only if the sequence x — y is “eventually zero.” [Hint: If x — y is 
not eventually zero, show there is homeomorphism h of R” with itself such 
that A(x) is bounded and h(y) is unbounded.] 

3. Show that the ordered square is locally connected but not locally path connected. 

What are the path components of this space? 

4. Let X be locally path connected. Show that every connected open set in X is 
path connected. 
5. Let X denote the rational points of the interval [0, 1] x 0 of R?. Let T denote the 

union of all line segments joining the point p = 0 x 1 to points of X. 

(a) Show that T is path connected, but is locally connected only at the point p. 

(b) Find a subset of R? that is path connected but is locally connected at none 
of its points. 

6. A space X is said to be weakly locally connected at x if for every neighbor- 
hood U of x, there is a connected subspace of X contained in U that contains 

a neighborhood of x. Show that if X is weakly locally connected at each of its 

points, then X is locally connected. [Hint: Show that components of open sets 

are open.) 
7. Consider the “infinite broom” X pictured in Figure 25.1. Show that X is not lo- 
cally connected at p, but is weakly locally connected at p. (Hint: Any connected 

neighborhood of p must contain all the points a;.] 
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P 4,4, a, a, a, a, a, 


Figure 25.1 


8. Let p : X — Y be a quotient map. Show that if X is locally connected, then Y 
is locally connected. [Hint: If C is a component of the open set U of Y, show 
that p—!(C) is a union of components of p~'(U).] 

9. Let G be a topological group; let C be the component of G containing the identity 
element e. Show that C is a normal subgroup of G. [Hint: If x € G, then xC is 
the component of G containing x.] 


10. Let X be a space. Let us define x ~ y if there is no separation X = A U B of X 

into disjoint open sets such that x € A and y € B. 

(a) Show this relation is an equivalence relation. The equivalence classes are 
called the guasicomponents of X. 

(b) Show that each component of X lies in a quasicomponent of X , and that 
the components and quasicomponents of X are the same if X is locally con- 
nected. 

(c) Let K denote the set {1/7 | n € Z+} and let — K denote the set (—1/n |n € 
Z4). Determine the components, path components, and quasicomponents of 
the following subspaces of R?: 


A= (K x [0, 1]J)U {0 x 0} U (0 x 1}. 
B = A U ([0, 1] x (0}). 
C =(K x [0,1}) U(-K x [-1, 0) U ([0, 1] x —K) U ([-1,0] x K). 


§26 Compact Spaces 


The notion of compactness is not nearly so natural as that of connectedness. From the 
beginnings of topology, it was clear that the closed interval (a, b] of the real line had 
a certain property that was crucial for proving such theorems as the maximum value 
theorem and the uniform continuity theorem. But for a long time, it was not clear 
how this property should be formulated for an arbitrary topological space. It used to 
be thought that the crucial property of [a, b] was the fact that every infinite subset 
of [a, b] has a limit point, and this property was the one dignified with the name of 
compactness. Later, mathematicians realized that this formulation does not lie at the 
heart of the matter, but rather that a stronger formulation, in terms of open coverings 
of the space, is more central. The latter formulation is what we now call compactness. 
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It is not as natural or intuitive as the former; some familiarity with it is needed before 
its usefulness becomes apparent. 


Definition. A collection Æ of subsets of a space X is said to cover X, or to be a 
covering of X, if the union of the elements of A is equal to X. It is called an open 
covering of X if its elements are open subsets of X. 


Definition. A space X is said to be compact if every open covering A of X contains 
a finite subcollection that also covers X. 


EXAMPLE |. The real line R is not compact, for the covering of R by open intervals 
A = {(n,n+2)|n €Z} 


contains no finite subcollection that covers R. 


EXAMPLE 2. The following subspace of R is Compact: 
X =(0}U([I/n| n € Z4}. 


Given an open covering A of X, there is an element U of A containing 0. The set U 
contains all but finitely many of the points 1/n; choose, for each point of X not in U, an 
element of A containing it. The collection consisting of these elements of A, along with 
the element U, is a finite subcollection of A that covers X. 


EXAMPLE 3. Any space X containing only finitely many points is necessarily compact, 
because in this case every open covering of X is finite. 


EXAMPLE 4. The interval (0, 1] is not compact; the open covering 
A= {(i/n, 1] | n € Z4} 


contains no finite subcollection covering (0, 1]. Nor is the interval (0, 1) compact; the 
same argument applies. On the other hand, the interval [0, 1] ¿s compact; you are probably 
already familiar with this fact from analysis. In any case, we shall prove it shortly. 


In general, it takes some effort to decide whether a given space is compact or 
not. First we shall prove some general theorems that show us how to construct new 
compact spaces out of existing ones. Then in the next section we shall show certain 
specific spaces are compact. These spaces include all closed intervals in the real line, 
and all closed and bounded subsets of R”. 

Let us first prove some facts about subspaces. If Y is a subspace of X, a collec- 
tion A of subsets of X is said to cover Y if the union of its elements contains Y. 


Lemma 26.1. Let Y be a subspace of X. Then Y is compact if and only if every 
covering of Y by sets open in X contains a finite subcollection covering Y. 
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Proof. Suppose that Y is compact and A = (Ag)acy is a covering of Y by sets open 
in X. Then the collection 


(4aN¥ laeJ) 
is a covering of Y by sets open in Y; hence a finite subcollection 
{Aa NY,..., Aa, QY} 


covers Y. Then {Ag,,..., Aa, } is a subcollection of A that covers Y. 

Conversely, suppose the given condition holds; we wish to prove Y compact. Let 
A’ = {A} be a covering of Y by sets open in Y. For each a, choose a set Aq open 
in X such that 


Ay = Aa NY. 
The collection A = (Ag} is a covering of Y by sets open in X. By hypothesis, some 
finite subcollection {Ag,,..-, Aw,} covers Y. Then {Ag Seay Aga) is a subcollection 
of A’ that covers Y. | 


Theorem 26.2. Every closed subspace of a compact space is compact. 


Proof. Let Y be a closed subspace of the compact space X. Given a covering A of Y 
by sets open in X, let us form an open covering B of X by adjoining to A the single 
open set X — Y, that is, 


B=AU(X-Y}. 


Some finite subcollection of B covers X. If this subcollection contains the set X — Y, 
discard X — Y; otherwise, leave the subcollection alone. The resulting collection is a 
finite subcollection of A that covers Y. a 


Theorem 26.3. Every compact subspace of a Hausdorff space is closed. 


Proof. Let Y be a compact subspace of the Hausdorff space X. We shall prove that 
X — Y is open, so that Y is closed. 

Let xo be a point of X — Y. We show there is a neighborhood of xo that is disjoint 
from Y. For each point y of Y, let us choose disjoint neighborhoods Uy and V, of the 
points xo and y, respectively (using the Hausdorff condition). The collection (V, | y € 
Y} is a covering of Y by sets open in X; therefore, finitely many of them Vy,,..., Vy, 
cover Y. The open set 


V = Va U- U Vy 
contains Y, and it is disjoint from the open set 
U = Uy, ---NU,, 


formed by taking the intersection of the corresponding neighborhoods of xo. For if z 
is a point of V, then z € Vy, for some i, hence z ¢ Uy, and so z ¢ U. See Figure 26.1. 
Then U is a neighborhood of xo disjoint from Y, as desired. i E 
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Figure 26.1 


The statement we proved in the course of the preceding proof will be useful to us 
later, so we repeat it here for reference purposes: 


Lemma 26.4. IfY is a compact subspace of the Hausdorff space X and xo is not m Y, 
then there exist disjoint open sets U and V of X containing xq and Y, respectively. 


EXAMPLE 5. Once we prove that the interval [a, b] in R is compact, it follows from 
Theorem 26 2 that any closed subspace of [a, b] is compact. On the other hand, it follows 
from Theorem 26.3 that the intervals (a, b] and (a, b) in R cannot be compact (which we 
knew already) because they are not closed in the Hausdorff space R 


EXAMPLE 6. One needs the Hausdorff condition in the hypothesis of Theorem 26 3 
Consider, for example, the finite complement lopology on the real line The only proper 
subsets of R that are closed in this topology are the finite sets. But every subset of R is 
compact in this topology, as you can check. 


Theorem 26.5. The image of a compact space under a continuous map is compact. 


Proof. Let f : X — Y be continuous; let X be compact. Let A be a covering of the 
set f (X) by sets open in Y. The collection 


(f(A) | A€ A} 


is a collection of sets covering X; these sets are open in X because f is continuous. 
Hence finitely many of them, say 


F(A) fT An), 


cover X. Then the sets Aj,..., An cover f(X). a 
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One important use of the preceding theorem is as a tool for verifying that a map is 
a homeomorphism: 


Theorem 26.6. Let f : X — Y bea bijective continuous function. If X is compact 
and Y is Hausdorff, then f is a homeomorphism 


Proof. We shall prove that images of closed sets of X under f are closed in Y; this 
will prove continuity of the map f~'. If A is closed in X, then A is compact, by 
Theorem 26.2. Therefore, by the theorem just proved, f(A) is compact. Since Y is 
Hausdorff, f (A) is closed in Y, by Theorem 26.3. u 


Theorem 26.7. The product of finitely many compact spaces is compact. 


Proof. We shall prove that the product of two compact spaces is compact; the theo- 
rem follows by induction for any finite product. 


Step 1. Suppose that we are given spaces X and Y, with Y compact. Suppose that 
Xo is a point of X, and N is an open set of X x Y containing the “slice” xg x Y of 
X x Y We prove the following: 
There is a neighborhood W of xo in X such that N contains the entire set 
WxY 


The set W x Y is often called a tube about xq x Y. 

First let us cover xo x Y by basis elements U x V (for the topology of X x Y) 
lying in N. The space xo x Y is compact, being homeomorphic to Y. Therefore, we 
can cover xo x Y by finitely many such basis elements 


Ui x Vi,...,Un X Vn 


(We assume that each of the basis elements U; x V; actually intersects xo x Y, since 
otherwise that basis element would be superfluous; we could discard it from the finite 
collection and still have a covering of xo x Y.) Define 


w=U,N-- NU, 


The set W is open, and it contains xo because each set U; x V; intersects x9 x Y. 

We assert that the sets U, x V;, which were chosen to cover the slice xg x Y, 
actually cover the tube W x Y. Let x x y be a point of W x Y. Consider the point 
xo x y of the slice x9 x Y having the same y-coordinate as this point. Now xo x y 
belongs to U, x V; for some i, so that y € V;. Butx € Uj for every j (because x € W). 
Therefore, we have x x y € U; x V;,, as desired. 

Since all the sets U; x V; lie in N, and since they cover W x Y, the tube W x Y 
lies in N also. See Figure 26.2. 


Step 2. Now we prove the theorem. Let X and Y be compact spaces. Let A 
be an open covering of X x Y. Given xo € X, the slice x9 x Y is compact and 
may therefore be covered by finitely many elements Aj,..., Am of A. Their union 
N =A ,U---UA, is an open set containing xo x Y; by Step 1, the open set N contains 
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a tube W x Y about xo x Y, where W is open in X. Then W x Y is covered by finitely 
many elements A,,..., Am of A. 

Thus, for each x in X, we can choose a neighborhood W, of x such that the tube 
W, x Y can be covered by finitely many elements of A. The collection of all the 
neighborhoods W, is an open covering of X; therefore by compactness of X, there 
exists a finite subcollection 


(Wi,..., We} 
covering X. The union of the tubes 
WixY, .. W xY 


is all of X x Y; since each may be covered by finitely many elements of A, so may 
X x Y be covered. a 


The statement proved in Step | of the preceding proof will be useful to us later, so 
we repeat it here as a lemma, for reference purposes: 


Lemma 26.8 (The tube lemma). Consider the product space X x Y, where Y is 
compact. If N is an open set of X x Y containing the slice xy x Y of X x Y, then N 
contains some tube W x Y about xo x Y, where W is a neighborhood of xo in X. 


EXAMPLE7 The tube lemma is certainly not true if Y is not compact For example, let 
Y be the y-axis in R?, and le1 


N = [x x y, [x] < 1/0? + D}- 


Then N is an open set containing the set 0 x R, but it contains no tube about 0 x R It is 
illustrated in Figure 26 3 
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Figure 26.3 


There is an obvious question to ask at this point. /s the product of infinitely many 
compact spaces compact? One would hope that the answer is “yes,” and in fact it is. 
The result is important (and difficult) enough to be called by the name of the man who 
proved it; it is called the Tychonoff theorem 

In proving the fact that a cartesian product of connected spaces is connected, one 
proves it first for finite products and derives the general case from that. In proving 
that cartesian products of compact spaces are compact, however, there is no way to 
go directly from finite products to infinite ones. The infinite case demands a new 
approach, and the proof is a difficult one. Because of its difficulty, and also to avoid 
losing the main thread of our discussion in this chapter, we have decided to postpone it 
until later. However, you can study it now if you wish; the section in which it is proved 
(§37) can be studied immediately after this section without causing any disruption in 
continuity. 

There is one final criterion for a space to be compact, a criterion that is formulated 
in terms of closed sets rather than open sets It does not look very natural nor very 
useful at first glance, but it in fact proves to be useful on a number of occasions. First 
we make a definition. 


Definition. A collection C of subsets of X is said to have the finite intersection 
property if for every finite subcollection 
{C1,...,Cn} 


of C, the intersection Ci N-- MC, is nonempty. 
Theorem 26.9. Let X be a topological space. Then X is compact if and only if 


for every collection C of closed sets in X having the finite intersection property, the 
intersection (\ce@ C of all the elements of € is nonempty. 
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Proof. Given a collection A of subsets of X, let 
C={X —Aj[AEA} 


be the collection of their complements. Then the following statements hold: 
(1) A is a collection of open sets if and only if C is a collection of closed sets. 


(2) The collection A covers X if and only if the intersection (\c¢@C of all the 
elements of C is empty 
(3) The finite subcollection {A;,..., An} of A covers X if and only if the intersec- 
tion of the corresponding elements C, = X — A; of C is empty. 
The first statement is trivial, while the second and third follow from DeMorgan’s law: 
x = (U 4a) = (| (X = Aa). 
ael ael 
The proof of the theorem now proceeds in two easy steps: taking the contrapositive 
(of the theorem), and then the complement (of the sets)! 
The statement that X is compact is equivalent to saying: “Given any collection A 
of open subsets of X, if A covers X, then some finite subcollection of A covers X.” 
This statement is equivalent to its contrapositive, which is the following: “Given any 
collection A of open sets, if no finite subcollection of 4 covers X, then A does not 
cover X.” Letting C be, as earlier, the collection {X — A | A € A} and applying 
(1}-(3), we see that this statement is in turn equivalent to the following: “Given any 
collection C of closed sets, if every finite intersection of elements of C is nonempty, 
then the intersection of all the elements of C is nonempty.” This is just the condition 
of our theorem. a 


A special case of this theorem occurs when we have a nested sequence C; D C2 D 
‘++ Ca D Cay) D... of closed sets in a compact space X. If each of the sets C, is 
nonempty, then the collection C = {Cr }nez, automatically has the finite intersection 
property. Then the intersection 

‘ane 


neZ, 
is nonempty. 
We shall use the closed set criterion for compactness in the next section to prove 
the uncountability of the set of real numbers, in Chapter 5 when we prove the Ty- 
chonoff theorem, and again in Chapter 8 when we prove the Baire category theorem. 


Exercises 


1. (a) Let 7 and 7’ be two topologies on the set X; suppose that T’ > F. What 
does compactness of X under one of these topologies imply about compact- 
ness under the other? 

(b) Show that if X is compact Hausdorff under both F and 7’, then either F 
and 7’ are equal or they are not comparable. 


§26 


10. 


11. 
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. (a) Show that in the finite complement topology on R, every subspace is com- 


pact. 
(b) If R has the topology consisting of all sets A such that R ~ A is either 
countable or all of R, is [0, 1] a compact subspace? 


. Show that a finite union of compact subspaces of X is compact. 
. Show that every compact subspace of a metric space is bounded in that metric 


and is closed. Find a metric space in which not every closed bounded subspace 
is compact. 


. Let A and B be disjoint compact subspaces of the Hausdorff space X. Show that 


there exist disjoint open sets U and V containing A and B, respectively. 


. Show that if f : X — Y is continuous, where X is compact and Y is Hausdorff, 


then f is a closed map (that is, f carries closed sets to closed sets). 


. Show that if Y is compact, then the projection 2; : X x Y — X is a closed map. 
. Theorem. Let f : X — Y; let Y be compact Hausdorff. Then f is continuous 


if and only if the graph of f, 
Gp = (xx f(x) |x X), 


is closed in X x Y. [Hint: If Gy is closed and V is a neighborhood of f (xo), 
then the intersection of G f and X x (Y — V) is closed. Apply Exercise 7.] 


. Generalize the tube lemma as follows: 


Theorem. Let A and B be subspaces of X and Y, respectively; let N be an open 
set in X x Y containing A x B. If A and B are compact, then there exist open 
sets U and V in X and Y, respectively, such that 


AxBCUXVCN. 


(a) Prove the following partial converse to the uniform limit theorem: 
Theorem. Let f, : X — R be a sequence of continuous functions, with 
fa(x) > f(x) foreach x € X. If f is continuous, and if the sequence fa is 
monotone increasing, and if X is compact, then the convergence is uniform. 
[We say that fa is monotone increasing if fa(x) < fn4i(x) for all n and x.] 

(b) Give examples to show that this theorem fails if you delete the requirement 
that X be compact, or if you delete the requirement that the sequence be 
monotone. [Hint: See the exercises of §21.] 

Theorem. Let X be a compact Hausdorff space. Let A be a collection of closed 

connected subsets of X that is simply ordered by proper inclusion. Then 


y=()A 
ACA 
is connected. [Hint: If C U D is a separation of Y, choose disjoint open sets U 
and V of X containing C and D, respectively, and show that 


(\(a-Wvy)) 


AEA 
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is not empty.] 

12. Let p : X — Y be a closed continuous surjective map such that pay) is 
compact, for each y € Y. (Such a map is called a perfect map ) Show that if Y 
is compact, then X is compact. (Hint: If U is an open set containing p(y), 
there is a neighborhood W of y such that p—(W) is contained in U ] 


13. Let G be a topological group. 

(a) Let A and B be subspaces of G. If A is closed and B is compact, show A- B 
is closed. [Hint: If c is not in A - B, find a neighborhood W of c such that 
W - B7! is disjoint from A.] 

(b) Let H be a subgroup of G; let p ` G + G/H be the quotient map. If H is 
compact, show that p is a closed map. 

(c) Let H be a compact subgroup of G. Show that if G/H is compact, then G 
is compact. 


§27 Compact Subspaces of the Real Line 


The theorems of the preceding section enable us to construct new compact spaces from 
existing ones, but in order to get very far we have to find some compact spaces to start 
with. The natural place to begin is the real line; we shall prove that every closed inter- 
val in R is compact. Applications include the extreme value theorem and the uniform 
continuity theorem of calculus, suitably generalized. We also give a characterization 
of all compact subspaces of R”, and a proof of the uncountability of the set of real 
numbers. 

It turns out that in order to prove every closed interval in R is compact, we need 
only one of the order properties of the real line—the least upper bound property. We 
shall prove the theorem using only this hypothesis; then it will apply not only to the 
real line, but to well-ordered sets and other ordered sets as well. 


Theorem 27.1. Let X be a simply ordered set having the least upper bound property. 
In the order topology, each closed interval in X is compact. 


Proof. Step l. Givena < b, let A be a covering of [a, b] by sets open in fa, b] in the 
subspace topology (which is the same as the order topology). We wish to prove the 
existence of a finite subcollection of A covering [a, b]. First we prove the following: 
If x is a point of {a, b} different from b, then there is a point y > x of (a, b] such that 
the interval (x, y] can be covered by at most two elements of A 

If x has an immediate successor in X, let y be this immediate successor. Then 
[x, y] consists of the two points x and y, So that it can be covered by at most two 
elements of A. If x has no immediate successor in X, choose an element A of A 
containing x. Because x # b and A is open, A contains an interval of the form [x, c), 
for some c in fa, b]. Choose a point y in (x, c); then the interval [x, y] is covered by 
the single element A of A. 
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Step 2. Let C be the set of all points y > a of [a, b] such that the interval [a, y] 
can be covered by finitely many elements of A Applying Step | to the case x = a, 
we see that there exists at least one such y, so C is not empty. Let c be the least upper 
bound of the set C; thena <c < b. 


Step 3. We show that c belongs to C; that is, we show that the interval (a, c] can 
be covered by finitely many elements of A. Choose an element A of A containing c; 
since A is open, it contains an interval of the form (d, c] for some d in fa, b]. If c is 
not in C, there must be a point z of C lying in the interval (d, c), because otherwise d 
would be a smaller upper bound on C than c. See Figure 27.1. Since z is in C, the 
interval [a, z} can be covered by finitely many, say n, elements of A. Now fz, c] lies 
in the single element A of A, hence [a, c] = (a, z] U [z, c] can be covered by n + 1 
elements of A. Thus c is in C, contrary to assumption. 


X yory 
a d c a c b 
Figure 27.1 Figure 27.2 


Step 4. Finally, we show that c = b, and our theorem is proved. Suppose that 
c < b. Applying Step 1 to the case x = c, we conclude that there exists a point y > c 
of [a, b] such that the interval [c, y] can be covered by finitely many elements of A. 
See Figure 27.2. We proved in Step 3 that c is in C, so [a, c] can be covered by finitely 
many elements of A. Therefore, the interval 


Ia, y] = [a,c] U[e, y] 
can also be covered by finitely many elements of A. This means that y is in C, con- 


tradicting the fact that c is an upper bound on C. a 


Corollary 27.2. Every closed interval in R is compact. 

Now we characterize the compact subspaces of R”: 
Theorem 27.3. A subspace A of R” is compact if and only if it is closed and is 
bounded in the euclidean metric d or the square metric p. 


Proof. kt will suffice to consider only the metric p; the inequalities 


p(x, y) < d(x, y) < Vnp(x, y) 


imply that A is bounded under d if and only if it is bounded under p. 
Suppose that A is compact. Then, by Theorem 26.3, it is closed. Consider the 
collection of open sets 


{B,(0, m) | m € Z4}, 
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whose union is all of R". Some finite subcollection covers A It follows that A C 
B,(0, M) for some M. Therefore, for any two points x and y of A, we have p(x, y) < 
2M. Thus A is bounded under p. 

Conversely, suppose that A is closed and bounded under p; suppose that p(x, y) < 
N for every pair x, y of points of A. Choose a point x9 of A, and let p(xo,0) = b. 
The triangle inequality implies that p(x, 0) < N + b for every x in A. If P = N +b, 
then A is a subset of the cube [— P, P]", which is compact. Being closed, A is also 
compact. a 


Students often remember this theorem as stating that the collection of compact 
sets in a metric space equals the collection of closed and bounded sets. This statement 
is clearly ridiculous as it stands, because the question as to which sets are bounded 
depends for its answer on the metric, whereas which sets are compact depends only on 
the topology of the space. 


EXAMPLE | The unit sphere S”-! and the closed unit ball B” in R” are compact 
because they are closed and bounded. The set 


A= {x x (l/x)]0 <x <1} 
is closed in R?, but it is not compact because it is not bounded. The set 
S = {x x (sin(1/x) |O<x < |} 
is bounded in R?, but it is not compact because it is not closed 


Now we prove the extreme value theorem of calculus, in suitably generalized form. 


Theorem 27.4 (Extreme value theorem). Let f . X — Y be continuous, where Y 
is an ordered set in the order topology. If X 1s compact, then there exist points c and d 
in X such that f(c) < f(x) < f(d) for every x € X. 


The extreme value theorem of calculus is the special case of this theorem that 
occurs when we take X to be a closed interval in R and Y to be R. 


Proof. Since f is continuous and X is compact, the set A = f(X) is compact. We 

show that A has a largest element M and a smallest element m. Then since m and M 

belong to A, we must have m = f(c) and M = f(d) for some points c and d of X. 
If A has no largest element, then the collection 


{(—00, a) |a € A} 
forms an open covering of A. Since A is compact, some finite subcollection 
{(—00, a1),-. ,(—00, an)} 


covers A If a; is the largest of the elements a), ...a,, then a, belongs to none of these 
sets, contrary to the fact that they cover A. 
A similar argument shows that A has a smallest element. a 
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Now we prove the uniform continuity theorem of calculus. In the process, we 
are led to introduce a new notion that will prove to be surprisingly useful, that of a 
Lebesgue number for an open covering of a metric space. First, a preliminary notion: 


Definition. Let (X,d) be a metric space; let A be a nonempty subset of X. For each 
x € X, we define the distance from x to A by the equation 


d(x, A) = inf{d(x, a) | a € A}. 


It is easy to show that for fixed A, the function d(x, A) is a continuous function 
of x: Given x, y € X, one has the inequalities 


d(x, A) < d(x,a) < d(x, y)+d(y,a), 
for each a € A. It follows that 
d(x, A) — d(x, y) < infd(y, a) = d(y, A), 
so that 
d(x, A) — d(y, A) < d(x, y). 


The same inequality holds with x and y interchanged; continuity of the function 
d(x, A) follows. 

Now we introduce the notion of Lebesgue number. Recall that the diameter of a 
bounded subset A of a metric space (X, d) is the number 


sup{d (a1, 42) | a1, a2 € A}. 


Lemma 27.5 (The Lebesgue number lemma). Let A be an open covering of the 
metric space (X,d). If X is compact, there is aô > O such that for each subset of X 
having diameter less than 6, there exists an element of A containing it. 


The number 6 is called a Lebesgue number for the covering A. 
Proof. Let A be an open covering of X. If X itself is an element of A, then any 
positive number is a Lebesgue number for A. So assume X is not an element of A. 
Choose a finite subcollection {A,,...,A,} of A that covers X. For each i, set 
Ci = X — Aj, and define f : X — R by letting f(x) be the average of the numbers 
d(x,C,). That is, 


1 n 
fœ) = a at C): 


We show that f(x) > 0 for all x. Given x € X, choose i so that x € A;. Then choose € 
so the €-neighborhood of x lies in Aj. Then d(x, C;) > €, so that f(x) > €/n. 
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Since f is continuous, it has a minimum value 6; we show that ô is our required 
Lebesgue number Let B be a subset of X of diameter less than ô. Choose a point x9 
of B, then B lies in the 5-neighborhood of x9. Now 


ô < f (xo) < d(xo, Cn), 


where d(xo, Cm) is the largest of the numbers d(xo, C,). Then the 5-neighborhood 
of xo is contained in the element Am = X — Cm of the covering A. a 


Definition. A function f from the metnc space (X, dx) to the metric space (Y, dy) 
is said to be uniformly continuous if given e > 0, there is a ô > 0 such that for every 
pair of points xo, xı of X, 


dx (xo, x1) < 56 => dry (f (xo), f (x1)) < € 


Theorem 27.6 (Uniform continuity theorem). Let f . X — Y be a continuous 
map of the compact metnic space (X, dx) to the metric space (Y, dy). Then f is 
uniformly continuous. 


Proof. Given e > 0, take the open covering of Y by balls B(y, €/2) of radius €/2. 
Let A be the open covering of X by the inverse images of these balls under f. Choose ô 
to be a Lebesgue number for the covering A. Then if x; and x2 are two points of X 
such that dx (x1. x2) < ô, the two-point set {x;, x2} has diameter less than ô, so that 
its image {f (x1), f(x2)} lies in some ball B(y, €/2). Then dy (f (x1), f(x2)) < €, as 
desired a 


Finally, we prove that the real numbers are uncountable. The interesting thing 
about this proof is that it involves no algebra at all—no decimal or binary expansions 
of real numbers or the like—just the order properties of R. 


Definition. If X is a space, a point x of X is said to be an isolated point of X if the 
one-point set {x} is open in X 


Theorem 27.7. Let X be a nonempty compact Hausdorff space. If X has no isolated 
points, then X is uncountable. 


Proof. Step I. We show first that given any nonempty open set U of X and any 
point x of X, there exists a nonempty open set V contained in U such that x ¢ V 

Choose a point y of U different from x; this is possible if x is in U because x is not 
an isolated point of X and it is possible if x is not in U simply because U is nonempty. 
Now choose disjoint open sets W; and W2 about x and y, respectively. Then the set 
V = WN U is the desired open set; it is contained in U, it is nonempty because it 
contains y, and its closure does not contain x See Figure 27.3 


Step 2. We show that given f : Z} —> X, the function f is not surjective. It 
follows that X is uncountable. 
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x or x 


Figure 27.3 


Let x, = f(n). Apply Step | to the nonempty open set U = X to choose a 
nonempty open set V; C X such that V, does not contain xı. In general, given Va- 
open and nonempty, choose V, to be a nonempty open set such that Va C Vn— and Vp 
does not contain x,. Consider the nested sequence 


VDVv2D--- 


of nonempty closed sets of X. Because X is compact, there is a point x € f} Va, by 
Theorem 26.9. Now x cannot equal x, for any n, since x belongs to V, and x, does 
not. a 


Corollary 27.8. Every closed interval in R is uncountable. 


Exercises 


1. Prove that if X is an ordered set in which every closed interval is compact, then X 
has the least upper bound property. 


2. Let X be a metric space with metric d; let A C X be nonempty. 
(a) Show that d(x, A) = 0 if and only if x € A. 
(b) Show that if A is compact, d(x, A) = d(x, a) for some a € A. 
(c) Define the e-neighborhood of A in X to be the set 


U(A,€) = {x | d(x, A) < €}. 


Show that U (A, €) equals the union of the open balls By(a,€) fora € A 
(d) Assume that A is compact; let U be an open set containing A. Show that 
some €-neighborhood of A is contained in U. 
(e) Show the result in (d) need not hold if A is closed but not compact. 


3. Recall that Rg denotes R in the K-topology. 
(a) Show that {0, 1] is not compact as a subspace of Rx. 
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(b) Show that Rx is connected. [Hint (—oo, 0) and (0, oo) inherit their usual 
topologies as subspaces of Rx.) 
(c) Show that Rx is not path connected. 
4. Show that a connected metric space having more than one point is uncountable. 
5. Let X be a compact Hausdorff space, let {A,} be a countable collection of closed 
sets of X. Show that if each set A,, has empty intenor in X, then the union |} A, 
has empty interior in X. [Hint: Imitate the proof of Theorem 27.7.] 
This is a special case of the Baire category theorem, which we shall study in 
Chapter 8. 
6. Let Ao be the closed interval (0, 1] in R. Let A, be the set obtained from Ag by 
deleting its “middle third” G 3). Let Az be the set obtained from A, by deleting 
its “middle thirds” G. 3) and G, 8) In general, define A, by the equation 


© /1+3k 2+ 3k 
An = Ani -U( 3n * zn ). 
k=0 


The intersection 


is called the Cantor set, it is a subspace of (0, 1] 

(a) Show that C is totally disconnected. 

(b) Show that C is compact. 

(c) Show that each set A, is a union of finitely many disjoint closed intervals of 
length 1/3”; and show that the end points of these intervals lie in C. 

(d) Show that C has no isolated points. 

(e) Conclude that C is uncountable. 


§28 Limit Point Compactness 


As indicated when we first mentioned compact sets, there are other formulations of 
the notion of compactness that are frequently useful. In this section we introduce 
one of them. Weaker in general than compactness, it coincides with compactness for 
metrizable spaces. 


Definition. A space X is said to be limit point compact if every infinite subset of X 
has a limit point. 

In some ways this property is more natural and intuitive than that of compactness. 
In the early days of topology, it was given the name “compactness,” while the open 


covering formulation was called “bicompactness.” Later, the word “compact” was 
shifted to apply to the open covering definition, leaving this one to search for a new 
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name It still has not found a name on which everyone agrees On historical grounds, 
some call it “Fréchet compactness”, others call it the “Bolzano-Weierstrass property” 
We have invented the term “limit point compactness ” It seems as good a term as any; 
at least it describes what the property is about. 


Theorem 28.1. Compactness implies limit point compactness, but not con versely. 


Proof. Let X be acompact space. Given a subset A of X, we wish to prove that if A 
is infinite, then A has a limit point. We prove the contrapositive—if A has no limit 
point, then A must be finite. 

So suppose A has no limit point. Then A contains all its limit points, so that A is 
closed. Furthermore, for each a € A we can choose a neighborhood U, of a such that 
Ua intersects A in the point a alone The space X is covered by the open set X — A 
and the open sets Va; being compact, it can be covered by finitely many of these sets. 
Since X — A does not intersect A, and each set Ua contains only one point of A, the 
set A must be finite. a 


EXAMPLE | Let Y consist of two points, give Y the topology consisting of Y and 
the empty set Then the space X = Z, x Y is limit point compact, for every nonempty 
subset of X has a limit point. It is not compact, for the covenng of X by the open sets 
Un = {n} x Y has no finite subcollection covering X 


EXAMPLE 2 Here is a less trivial example Consider the minimal! uncountable well- 
ordered set Sq, in the order topology The space Sg is not compact, since it has no largest 
element However, it is limit point compact: Let A be an infinite subset of Sg. Choose a 
subset B of A that is countably infinite Being countable, the set B has an upper bound b 
in Sg; then B is a subset of the interval [a9, b] of Sg, where ap is the smallest element 
of Sg. Since Sg has the least upper bound property, the interval (ap, b} is compact By the 
preceding theorem, B has a limut point x in [ap, b]. The point x is also a limit point of A 
Thus Sq is limit point compact 
We now show these two versions of compactness coincide for metrizable spaces; 
for this purpose, we introduce yet another version of compactness called sequential 
compactness. This result will be used in Chapter 7. 


Definition. Let X be a topological space. If (x,) is a sequence of points of X, and if 
NSR A <<: 


is an increasing sequence of positive integers, then the sequence (y; ) defined by setting 
Yi = Xn, is called a subsequence of the sequence (x,). The space X is said to be 
sequentially compact if every sequence of points of X has a convergent subsequence. 


*Theorem 28.2. Let X be a metnzable space. Then the following are equivalent: 
(1) X is compact. 
(2) X is limit point compact. 
(3) X is sequentially compact. 
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Proof. We have already proved that (1) = (2). To show that (2) => (3), assume 
that X is limit point compact. Given a sequence (x,) of points of X, consider the set 
A = [Xn | n € Z4}. If the set A is finite, then there is a point x such that x = x, for 
infinitely many values of n. In this case, the sequence (x,,) has a subsequence that is 
constant, and therefore converges trivially. On the other hand, if A is infinite, then A 
has a limit point x. We define a subsequence of (x,) converging to x as follows: First 
choose n; so that 


Xn, E€ B(x, 1). 


Then suppose that the positive integer n;_) is given. Because the ball B(x, 1/i) inter- 
sects A in infinitely many points, we can choose an index n; > n;—, such that 


Xn; € B(x, Ii). 


Then the subsequence Xn; , Xnz» » -- converges to x. 

Finally, we show that (3) = (1). This is the hardest part of the proof. 

First, we show that if X is sequentially compact, then the Lebesgue number lemma 
holds for X. (This would follow from compactness, but compactness is what we are 
trying to prove!) Let A be an open covering of X. We assume that there is no ô > 0 
such that each set of diameter less than ô has an element of A containing it, and derive 
a contradiction. 

Our assumption implies in particular that for each positive integer n, there exists a 
set of diameter less than 1 /n that is not contained in any element of A; let C,, be sucha 
set. Choose a point x, € Cn, foreach n. By hypothesis, some subsequence (x, ) of the 
sequence (x,) converges, say to the point a. Now a belongs to some element A of the 
collection A; because A is open, we may choose an é > O such that B(a, €) C A. If i 
is large enough that 1/n; < €/2, then the set C,, lies in the €/2-neighborhood of x, ; if 
i is also chosen large enough that d(x,,, a) < €/2, then C,, lies in the €-neighborhood 
of a. But this means that Ca, C A, contrary to hypothesis. 

Second, we show that if X is sequentially compact, then given € > 0, there exists 
a finite covering of X by open e€-balls. Once again, we proceed by contradiction. 
Assume that there exists an € > 0 such that X cannot be covered by finitely many 
€-balls. Construct a sequence of points x, of X as follows: First, choose x; to be any 
point of X. Noting that the ball B(x, €) is not all of X (otherwise X could be covered 
by a single e-ball), choose x2 to be a point of X not in B(x;,€). In general, given 
X1,.+-,Xq, Choose X741 to be a point not in the union 


B(x, €) U---U B(Xn, €), 


using the fact that these balls do not cover X. Note that by construction d (Xn+1, Xi) > 
e fori = 1,...,n. Therefore, the sequence (x,,) can have no convergent subsequence; 
in fact, any ball of radius €/2 can contain x, for at most one value of n. 

Finally, we show that if X is sequentially compact, then X is compact. Let A be 
an open covering of X. Because X is sequentially compact, the open covering A has 
a Lebesgue number ô. Let e = 5/3; use sequential compactness of X to find a finite 
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covering of X by open e-balls. Each of these balls has diameter at most 24/3, so it 


lies 


in an element of Æ. Choosing one such element of Æ for each of these e -balls, we 


obtain a finite subcollection of A that covers X. E 


Ex 


1. 


2 


3. 


EXAMPLE 3. Recall that Sg denotes the minimal uncountable well-ordered set Sq with 
the point Q adjoined. (In the order topology, Q is a limit point of Sg, which is why we 
introduced the notation Sg for Sq U {Q}, back in §10 ) It is easy to see that the space Sa 
is not metrizable, for it does not satisfy the sequence lemma: The point Q is a limit point 
of Sg, but it is not the limit of a sequence of points of So, for any sequence of points of Sg 
has an upper bound in Sq The space Sg, on the other hand, does satisfy the sequence 
lemma, as you can readily check Nevertheless, So is not metrizable, for it is limit point 
compact but not compact. 


ercises 


Give [0, 1]” the uniform topology. Find an infinite subset of this space that has 
no limit point 
Show that (0, 1] is not limit point compact as a subspace of Re. 


Let X be limit point compact. 

(a) If f > X — Y is continuous, does it follow that f (X) is limit point compact? 

(b) If A is a closed subset of X, does it follow that A is limit point compact? 

(c) If X is a subspace of the Hausdorff space Z, does it follow that X is closed 
in Z? 

We comment that it is not in general true that the product of two limit point com- 

pact spaces is limit point compact, even if the Hausdorff condition is assumed. 

But the examples are fairly sophisticated. See [S-S], Example 112. 


A space X is said to be countably compact if every countable open covering 
of X contains a finite subcollection that covers X. Show that for a T} space X, 
countable compactness is equivalent to limit point compactness. [Hint: If no 
finite subcollection of U, covers X, choose x, ¢ U; U---U Un, for each n.] 
Show that X is countably compact if and only if every nested sequence Ci D 
C2 >_ - of closed nonempty sets of X has a nonempty intersection. 


Let (X, d) be a metric space. If f : X — X satisfies the condition 


a(f (x), f(y) = d(x, y) 


for all x, y € X, then f is called an isometry of X. Show that if f is an isometry 
and X is compact, then f is bijective and hence a homeomorphism. [Hint: If 
a ¢ f(X), choose e so that the e-neighborhood of a is disjoint from f(X) Set 
X; =a,and Xn41 = f (Xn) in general. Show that d(xn, Xm) > € forn 4 m.] 


Let (X, d) be a metric space. If f satisfies the condition 


d(f(x), fy) < d(x, y) 
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for all x,y € X with x Æ y, then f is called a shrinking map If there is a 
number æ < | such that 


d(f (x), fO)) < ad(x, y) 


for all x, y € X, then f is called a contraction. A fixed point of f is a point x 

such that f(x) = x 

(a) If f is a contraction and X is compact, show f has a unique fixed point. 
[Hint: Define f? = f and f"*! = f o f”. Consider the intersection A of 
the sets A, = f"(X).] 

(b) Show more generally that if f is a shrinking map and X is compact, then f 
has a unique fixed point. [Hint: Let A be as before. Given x € A, choose xn 
so that x = f”+! (xn). If a is the limit of some subsequence of the sequence 
Yn = f"(Xn), Show that a € A and f(a) = x. Conclude that A = f(A), so 
that diam A = 0.] 

(c) Let X = [0,1]. Show that f(x) = x — x?/2 maps X into X and is a 

shrinking map that is not a contraction. [Hint: Use the mean-value theorem 

of calculus. ] 

The result in (a) holds if X is a complete metric space, such as R; see the 

exercises of §43. The result in (b) does not: Show that the map f ` R > 

R given by f(x) = [x + (x? + 1)'/7]/2 is a shrinking map that is not a 

contraction and has no fixed point. 


(d 


<~ 


§29 Local Compactness 


In this section we study the notion of local compactness, and we prove the basic the- 
orem that any locally compact Hausdorff space can be imbedded in a certain compact 
Hausdorff space that is called its one-point compactification. 


Definition. A space X is said to be locally compact at x if there is some compact 
subspace C of X that contains a neighborhood of x. If X is locally compact at each of 
its points, X is said simply to be locally compact. 


Note that a compact space is automatically locally compact. 


EXAMPLE! The real line R is locally compact. The point x lies in some interval (a, b), 
which in tum is contained in the compact subspace (a, b] The subspace Q of rational 
numbers is not locally compact, as you can check. 


EXAMPLE 2 The space R" is locally compact, the point x lies in some basis element 
(a1, 61) x- -X (Gn, bn), which in tum lies in the compact subspace [a;, b1] x + x (dn, bn]. 
The space R® is not locally compact; none of its basis elements are contained in compact 
subspaces For if 


B=(aq,b))x - x (Qn.bn)x Rx- xRx 
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were contained in a compact subspace, then its closure 
B = (a1, bi] x> + x lan, bn] x R x - 


would be compact, which it is not. 


EXAMPLE 3. Every simply ordered set X having the least upper bound property is 
locally compact: Given a basis element for X, it is contained in a closed interval in X, 
which is compact. 


Two of the most well-behaved classes of spaces to deal with in mathematics are the 
metrizable spaces and the compact Hausdorff spaces. Such spaces have many useful 
properties, which one can use in proving theorems and making constructions and the 
like. If a given space is not of one of these types, the next best thing one can hope for is 
that it is a subspace of one of these spaces. Of course, a subspace of a metrizable space 
is itself metrizable, so one does not get any new spaces in this way. But a subspace of a 
compact Hausdorff space need not be compact. Thus arises the question: Under what 
conditions is a space homeomorphic with a subspace of a compact Hausdorff space? 
We give one answer here. We shall return to this question in Chapter 5 when we study 
compactifications in general. 


Theorem 29.1. Let X be a space. Then X is locally compact Hausdorff if and only 
if there exists a space Y satisfying the following conditions: 

(1) X is a subspace of Y. 

(2) The set Y — X consists of a single point. 


(3) Y is a compact Hausdorff space. 
If Y and Y' are two spaces satisfying these conditions, then there is a homeomorphism 
of Y with Y’ that equals the identity map on X. 


Proof. Step 1. We first verify uniqueness. Let Y and Y’ be two spaces satisfying 
these conditions. Define h : Y —> Y’ by letting A map the single point p of Y — X to 
the point q of Y’ — X, and letting h equal the identity on X. We show that if U is open 
in Y, then A(U) is open in Y’. Symmetry then implies that A is a homeomorphism. 

First, consider the case where U does not contain p. Then A(U) = U. Since U is 
open in Y and is contained in X, it is open in X. Because X is open in Y’, the set U is 
also open in Y’, as desired. 

Second, suppose that U contains p. Since C = Y — U is closed in Y, it is compact 
as a subspace of Y. Because C is contained in X, it is a compact subspace of X. 
Then because X is a subspace of Y’, the space C is also a compact subspace of Y’. 
Because Y’ is Hausdorff, C is closed in Y’, so that h(U) = Y’ — C is open in Y’, as 
desired 

Step 2 Now we suppose X is locally compact Hausdorff and construct the space Y. 
Step 1 gives us an idea how to proceed. Let us take some object that is not a point 
of X, denote it by the symbol oo for convenience, and adjoin it to X, forming the set 
Y = X U {co}. Topologize Y by defining the collection of open sets of Y to consist 
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of (1) all sets U that are open in X, and (2) all sets of the form Y — C, where C is a 
compact subspace of X. 

We need to check that this collection is, in fact, a topology on Y The empty set is 
a set of type (1), and the space Y is a set of type (2). Checking that the intersection of 
two open sets is open involves three cases: 


ULNA U: is of type (1). 
(Y -CDNA (Y — C2) =Y - (C1 UC?) is of type (2). 
UN (Y -Cı)}=U,N(X-—-C:ı) isoftype(l), 


because C} is closed in X. Similarly, one checks that the union of any collection of 
open sets is open. 


J Ue =U is of type (1). 
Uv - cs) =Y- (cp = -C is of type (2). 
(Lua) UJ’ - Cp) =U UY -=Y -(C-U), 


which is of type (2) because C — U is aclosed subspace of C and therefore compact. 

Now we show that X is a subspace of Y. Given any open set of Y, we show its 
intersection with X is open in X. If U is of type (1), then UN X = U; if Y — Cis of 
type (2), then (Y — C) N X = X — C; both of these sets are open in X. Conversely, 
any set open in X is a set of type (1) and therefore open in Y by definition. 

To show that Y is compact, let A be an open covering of Y . The collection A must 
contain an open set of type (2), say Y — C, since none of the open sets of type (1) con- 
tain the point oo. Take all the members of A different from Y — C and intersect them 
with X; they form a collection of open sets of X covering C. Because C is compact, 
finitely many of them cover C; the corresponding finite collection of elements of A 
will, along with the element Y — C, cover all of Y 

To show that Y is Hausdorff, let x and y be two points of Y. If both of them lie 
in X, there are disjoint sets U and V open in X containing them, respectively. On the 
other hand, if x € X and y = œ, we can choose a compact set C in X containing 
a neighborhood U of x. Then U and Y — C are disjoint neighborhoods of x and co, 
respectively, in Y. 

Step 3. Finally, we prove the converse. Suppose a space Y satisfying conditions 
(1)}(3) exists. Then X is Hausdorff because it is a subspace of the Hausdorff space Y. 
Given x € X, we show X is locally compact at x Choose disjoint open sets U and V 
of Y containing x and the single point of Y — X, respectively Then the set C = Y — V 
is closed in Y, so it is a compact subspace of Y Since C lies in X, it is also compact 
as a subspace of X; it contains the neighborhood U of x. a 


If X itself should happen to be compact, then the space Y of the preceding theorem 
is not very interesting, foi it is obtained from X by adjoining a single isolated point. 
However, if X is not compact, then the point of Y — X is a limit point of X, so that 
X=Y 
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Definition. If Y is a compact Hausdorff space and X is a proper subspace of Y whose 
closure equals Y, then Y is said to be a compactification of X. If Y — X equals a single 
point, then Y is called the one-point compactification of X. 


We have shown that X has a one-point compactification Y if and only if X is 
a locally compact Hausdorff space that is not itself compact. We speak of Y as “the” 
one-point compactification because Y is uniquely determined up to a homeomorphism. 
EXAMPLE 4 The one-point compactification of the real line R is homeomorphic with 
the circle, as you may readily check Similarly, the one-point compactification of R? is 
homeomorphic to the sphere S?. If R? is looked at as the space C of complex numbers, 
then C U {oo} is called the Riemann sphere, or the extended complex plane 


In some ways our definition of local compactness is not very satisfying. Usually 
one says that a space X satisfies a given property “locally” if every x € X has “arbi- 
trarily small” neighborhoods having the given property. Our definition of local com- 
pactness has nothing to do with “arbitrarily small” neighborhoods, so there is some 
question whether we should call it local compactness at all. 

Here is another formulation of local compactness, one more truly “local” in nature; 
it is equivalent to our definition when X is Hausdorff. 


Theorem 29.2. Let X be a Hausdorff space Then X is locally compact if and only 
if given x in X, and given a neighborhood U of x, there is a neighborhood V of x such 
that V is compact and V C U 


Proof. Clearly this new formulation implies local compactness; the set C = V is the 
desired compact set containing a neighborhood of x. To prove the converse, suppose X 
is locally compact; let x be a point of X and let U be a neighborhood of x. Take the 
one-point compactification Y of X, and let C be the set Y — U Then C is closed 
in Y, so that C is a compact subspace of Y. Apply Lemma 26 4 to choose disjoint 
open sets V and W containing x and C, respectively. Then the closure V of V in Y is 
compact; furthermore, V is disjoint from C, so that VC U, as desired. C] 


Corollary 29.3. Let X be locally compact Hausdorff; let A be a subspace of X If A 
is closed in X or open in X, then A is locally compact. 


Proof. Suppose that A is closed in X. Given x € A, let C be a compact subspace 
of X containing the neighborhood U of x in X. Then C N A is closed in C and thus 
compact, and it contains the neighborhood U N A of x in A. (We have not used the 
Hausdorff condition here.) 

Suppose now that A is open in X. Given x € A, we apply the preceding theorem 
to choose a neighborhood V of x in X such that V is compact and V C A. Then 
C = V is a compact subspace of A containing the neighborhood V of x in A. a 


Corollary 29.4. A space X is homeomorphic to an open subspace of a compact 
Hausdorff space if and only if X is locally compact Hausdorff. 


Proof. This follows from Theorem 29 | and Corollary 29.3. a 
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Exercises 


1. Show that the rationals Q are not locally compact. 


. Let {Xa} be an indexed family of nonempty spaces. 


(a) Show that if [] Xa is locally compact, then each Xa is locally compact and 
Xa is compact for all but finitely many values of a. 
(b) Prove the converse, assuming the Tychonoff theorem. 


. Let X be a locally compact space. If f : X — Y is continuous, does it follow 


that f(X) is locally compact? What if f is both continuous and open? Justify 
your answer. 


4. Show that (0, 1]” is not locally compact in the uniform topology. 


. If f : Xı —» X2 is a homeomorphism of locally compact Hausdorff spaces, 


show f extends to a homeomorphism of their one-point compactifications. 


. Show that the one-point compactification of R is homeomorphic with the cir- 


cle S!. 


7. Show that the one-point compactification of Sq is homeomorphic with So. 


8. Show that the one-point compactification of Z} is homeomorphic with the sub- 


*11. 


space {0} U {1/n | n € Z4} of R. 


. Show that if G is a locally compact topological group and H is a subgroup, then 


G/H is locally compact. 


. Show that if X is a Hausdorff space that is locally compact at the point x, then 


for each neighborhood U of x, there is a neighborhood V of x such that V is 
compact and V C U. 
Prove the following: 


(a) Lemma. If p> X — Y isa quotient map and if Z is a locally compact 
Hausdorff space, then the map 


m=pxizg:XxZ—>YxZ 


is a quotient map. 
(Hint: If z7! (A) is open and contains x x y, choose open sets U; and V 
with V compact, such that x x y € U, x V and U; x V C 271(A). Given 
UixV C n™!(A), use the tube lemma to choose an open set U;+, containing 
p~'(p(U;)) such that U;+1 x V C m7"(A). Let U = Uj; show that U x V 
is a saturated neighborhood of x x y that is contained in 2~!(A).} 
An entirely different proof of this result will be outlined in the exercises 
of §46. 

(b) Theorem. Let p: A — B andq :C — D be quotient maps. If B and C 
are locally compact Hausdorff spaces, then p x q : A xC —> Bx Disa 
quotient map. 
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*Supplementary Exercises: Nets 


We have already seen that sequences are “adequate” to detect limit points, continuous 
functions, and compact sets in metnzable spaces. There is a generalization of the 
notion of sequence, called a net, that will do the same thing for an arbitrary topological 
space. We give the relevant definitions here, and leave the proofs as exercises. Recall 
that a relation < ona set A is called a partial order relation if the following conditions 
hold: 

(1) a < a forall a. 

(2) Ifa < Band B < a, then a = 8. 

(3) Ifa < Band $ x y, thena < y. 
Now we make the following definition: 

A directed set J is a set with a partial order < such that for each pair œ, 8 of 
elements of J, there exists an element y of J having the property that œ < y and 
Bxy. 

1. Show that the following are directed sets: 

(a) Any simply ordered set, under the relation <. 

(b) The collection of all subsets of a set S, partially ordered by inclusion (that 
is, A < BifAC B). 

(c) A collection A of subsets of S that is closed under finite intersections, par- 
tially ordered by reverse inclusion (that is A < B if A D B). 

(d) The collection of all closed subsets of a space X, partially ordered by inclu- 
sion. 

2. A subset K of J is said to be cofinal in J if for each a € J, there exists B € K 
such that a < 8. Show that if J is a directed set and K is cofinal in J, then K is 
a directed set. 

3. Let X be a topological space. A net in X is a function f from a directed set J 
into X. Ifa € J, we usually denote f(a) by xa. We denote the net f itself by 
the symbol (xu )aes, or merely by (Xa) if the index set is understood. 

The net (xq) is said to converge to the point x of X (wntten xg —> x) if for 
each neighborhood U of x, there exists æ € J such that 


a < p= > eu. 
Show that these definitions reduce to familiar ones when J = Z}. 
4. Suppose that 


(xaje; —> xin X and (Yalaey —> Yin Y. 


Show that (xæ X Ya) — x x yinX xY. 
5. Show that if X is Hausdorff, a net in X converges to at most one point. 
6. Theorem. Let A € X. Then x € A if and only if there is a net of points of A 
converging to x. 
[Hint: To prove the implication =, take as index set the collection of all neigh- 
borhoods of x, partially ordered by reverse inclusion.] 
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. Theorem Let f : X — Y. Then f is continuous if and only if for every con- 


vergent net (xq) in X, converging to x, say, the net ( f (xq)) converges to f(x). 


. Let f : J — X bea net in X; let f(a) = xa. If K is a directed set and 


g: K — J isa function such that 

(i) i xj => gi) < 8). 

(ii) g(K) is cofinal in J, 
then the composite function f og : K — X is called a subnet of (xa). Show 
that if the net (xa) converges to x, so does any subnet. 


. Let (xa)aez bea netin X. We say that x is an accumulation point of the net (xq) 


if for each neighborhood U of x, the set of those œ for which xq € U is cofinal 
in J. 
Lemma. The net (xq) has the point x as an accumulation point if and only if 
some subnet of (xq) converges to x. 

(Hint: To prove the implication =, let K be the set of all pairs (a, U} where 
a € J and U is a neighborhood of x containing xe. Define (a, U) < (B, V) if 
a < Band V C U. Show that K is a directed set and use it to define the subnet.] 


Theorem. X is compact if and only if every net in X has a convergent subnet. 
(Hint: To prove the implication =, let By = {xg | œ < f} and show that 
{Ba} has the finite intersection property. To prove <, let A be a collection of 
closed sets having the finite intersection property, and let B be the collection of 
all finite intersections of elements of A, partially ordered by reverse inclusion.] 


Corollary. Let G be a topological group; let A and B be subsets of G. If A is 
closed in G and B is compact, then A - B is closed in G. 

[Hint: First give a proof using sequences, assuming that G is metrizable.] 
Check that the preceding exercises remain correct if condition (2) is omitted from 
the definition of directed set. Many mathematicians use the term “directed set” 
in this more general sense. 


Chapter 4 


Countability and Separation 
Axioms 


The concepts we are going to introduce now, unlike compactness and connectedness, 
do not arise naturally from the study of calculus and analysis. They arise instead from a 
deeper study of topology itself. Such problems as imbedding a given space in a metric 
space or in a compact Hausdorff space are basically problems of topology rather than 
analysis. These particular problems have solutions that involve the countability and 
separation axioms. 

We have already introduced the first countability axiom; it arose in connection with 
our study of convergent sequences in §21. We have also studied one of the separation 
axioms—the Hausdorff axiom, and mentioned another—the Tı axiom. In this chapter 
we shall introduce other, and stronger, axioms like these and explore some of their 
consequences. Our basic goal is to prove the Urysohn metrization theorem. It says 
that if a topological space X satisfies a certain countability axiom (the second) and a 
certain separation axiom (the regularity axiom), then X can be imbedded in a metric 
space and is thus metrizable. 

Another imbedding theorem, important to geometers, appears in the last section 
of the chapter. Given a space that is a compact manifold (the higher-dimensional 
analogue of a surface), we show that it can be imbedded in some finite-dimensional 
euclidean space. 
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§30 The Countability Axioms 


Recall the definition we gave in §21. 


Definition. A space X is said to have a countable basis at x if there is a countable 
collection B of neighborhoods of x such that each neighborhood of x contains at least 
one of the elements of B. A space that has a countable basis at each of its points is 
said to satisfy the first countability axiom, or to be first-countable. 


We have already noted that every metnizable space satisfies this axiom; see §21. 

The most useful fact concerning spaces that satisfy this axiom is the fact that in 
such a space, convergent sequences are adequate to detect limit points of sets and to 
check continuity of functions. We have noted this before; now we state it formally as 
a theorem: 


Theorem 30.1. Let X be a topological space. 
(a) Let A be a subset of X. If there is a sequence of points of A converging to x, 
then x € A; the converse holds if X is first-countable. 
(b) Let f : X — Y. If f is continuous, then for every convergent sequence x, —> x 
in X, the sequence f(x,) converges to f(x). The converse holds if X is first- 
countable. 


The proof is a direct generalization of the proof given in §2] under the hypothesis 
of metnizability, so it will not be repeated here. 
Of much greater importance than the first countability axiom is the following: 


Definition. If a space X has a countable basis for its topology, then X is said to 
satisfy the second countability axiom, or to be second-countable. 


Obviously, the second axiom implies the first: if B is a countable basis for the 
topology of X, then the subset of B consisting of those basis elements containing the 
point x is a countable basis at x. The second axiom is, in fact, much stronger than the 
first; it is so strong that not even every metnc space satisfies it. 

Why then is this second axiom interesting? Well, for one thing, many familiar 
spaces do satisfy it. For another, it is a crucial hypothesis used in proving such theo- 
rems as the Urysohn metrization theorem, as we shall see. 


EXAMPLE | The real line R has a countable basis—the collection of all open inter- 
vals (a, b) with rational end points. Likewise, R” has a countable basis—the collection of 
all products of intervals having rational end points. Even R® has a countable basis—the 
collection of all products Mnuez, Uun. where Un is an open interval with rational end points 
for finitely many values of n, and Un = R for all other values of n. 


EXAMPLE 2 In the uniform topology, R” satisfies the first countability axiom (being 
metrizable). However, it does not satisfy the second. To verify this fact, we first show that 
if X is a space having a countable basis B, then any discrete subspace A of X must be 
countable Choose, for each a € A, a basis element Bg that intersects A in the point a 
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alone. If a and b are distinct points of A, the sets B, and By are different, since the first 
contains a and the second does not. It follows that the map a — B, is an injection of A 
into B, so A must be countable. 

Now we note that the subspace A of R” consisting of all sequences of O's and 1’s is 
uncountable; and it has the discrete topology because p(a, b) = | for any two distinct 
points a and b of A. Therefore, in the uniform topology R” does not have a countable 
basis. 


Both countability axioms are well behaved with respect to the operations of taking 
subspaces or countable products: 


Theorem 30.2. A subspace of a first-countable space is first-countable, and a count- 
able product of first-countable spaces is first-countable. A subspace of a second- 
countable space is second-countable, and a countable product of second-countable 
spaces is second-countable. 


Proof. Consider the second countability axiom. If 8 is a countable basis for X, then 
{BA | B € B} is a countable basis for the subspace A of X If B; is a countable 
basis for the space X;, then the collection of all products []U,, where U; € 8; for 
finitely many values of i and U; = X; for all other values of i, is a countable basis for 
IL Xi. 


The proof for the first countability axiom is similar. a 


Two consequences of the second countability axiom that will be useful to us later 
are given in the following theorem. First, a definition: 


Definition. A subset A of a space X is said to be dense in X if A = X. 


Theorem 30.3. Suppose that X has a countable basis. Then: 
(a) Every open covering of X contains a countable subcollection covering X. 
(b) There exists a countable subset of X that ts dense in X. 


Proof. Let {Bẹ} be a countable basis for X. 

(a) Let A be an open covering of X. For each positive integer n for which it is pos- 
sible, choose an element A, of A containing the basis element B,. The collection A’ 
of the sets A, is countable, since it is indexed with a subset J of the positive integers. 
Furthermore, it covers X: Given a point x € X, we can choose an element A of A 
containing x. Since A is open, there is a basis element B, such that x € Ba C A. 
Because B, lies in an element of A, the index n belongs to the set J, so A, is defined; 
since A, contains B,, it contains x. Thus A’ is a countable subcollection of A that 
covers X. 

(b) From each nonempty basis element B,, choose a point x,. Let D be the set 
consisting of the points x,. Then D is dense in X: Given any point x of X, every basis 
element containing x intersects D, so x belongs to D. E 
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The two properties listed in Theorem 30.3 are sometimes taken as alternative 
countability axioms. A space for which every open covering contains a countable 
subcovering is called a Lindelöf space A space having a countable dense subset is 
often said to be separable (an unfortunate choice of terminology).t Weaker in general 
than the second countability axiom, each of these properties is equivalent to the second 
countability axiom when the space is metrizable (see Exercise 5). They are less impor- 
tant than the second countability axiom, but you should be aware of their existence, for 
they are sometimes useful. It is often easier, for instance, to show that a space X has a 
countable dense subset than it is to show that X has a countable basis. If the space is 
metnizable (as it usually is in analysis), it follows that X is second-countable as well. 

We shall not use these properties to prove any theorems, but one of them—the 
Lindelof condition—will be useful in dealing with some examples. They are not as 
well behaved as one might wish under the operations of taking subspaces and cartesian 
products, as we shall see in the examples and exercises that follow. 


EXAMPLE 3. The space Ry satisfies all the countability axioms but the second. 

Given x € Re, the set of all basis elements of the form [x, x + 1/7) iS a countable 
basis at x. And it is easy to see that the rauonal numbers are dense in Re. 

To see that Ry has no countable basis, let B be a basis for Re. Choose for each x, an 
element B, of B such that x € B, C [x,x +1). Ifx # y, then By # By, since x = inf B, 
and y = inf B,. Therefore, B must be uncountable. 

To show that Ry is Lindelöf requires more work. It will suffice to show that every open 
covering of Re by basis elements contains a countable subcollection covering Rz. (You can 
check this } So let 


A = ([4e, baNaes 


be a covering of R by basis elements for the lower limit topology We wish to find a 
countable subcollection that covers R. 
Let C be the set 


C= (a, ba), 


aes 


which is a subset of R. We show the set R — C is countable. 

Let x be a point of R — C. We know that x belongs to no open interval (ag, bg), 
therefore x = ag for some index £. Choose such a £ and then choose qy to be a rational 
number belonging to the interval (ag, bg). Because (ag, bg) is contained in C, so is the 
interval (ag, qx) = (x, 94). It follows that if x and y are two points of R — C with x < y, 
then qx < qy (For otherwise, we would have x < y < gy < qx, So thal y would lie in the 
interval (x, qx) and hence in C.) Therefore the map x — qx of R — C into Q is injective, 
so that R — C is countable. 

Now we show that some countable subcollection of A covers R . Tu begin, choose for 
each element of R — C an element of A containing it; one obtains a countable subcollec- 
tion A’ of A that covers R — C. Now take the set C and topologize it as a subspace of R; 
in this topology, C satisfies the second countability axiom. Now C is covered by the sets 
(Gq, by), which are open in R and hence open in C Then some countable subcoliection 


t This is a good example of how a word can be overused. We have already defined what we mean 
by a separation of a space; and we shall discuss the separation axioms shortly 


§30 The Countability Axioms 193 


covers C. Suppose this subcollection consists of the elements (ag, ba ) for a = af), a2,... 
Then the collection 


A’ = (laa, ba) fo =a,a2, ..} 


is a countable subcollection of A that covers the set C, and A’ U A” is a countable subcol- 
lection of A that covers Rz 


EXAMPLE 4 The product of two Lindelöf spaces need not be Lindelöf. Although the 
space Rg is Lindelöf, we shall show that the product space Ry x Ry = R} is not. The space 
R? is an extremely useful example in topology called the Sorgenfrey plane 

The space R; has as basis all sets of the form [a, b) x [c, d) To show itis not Lindelöf, 
consider the subspace 


L = {x x (—x) | x € Ra} 


It is easy to check that L is closed in R?. Let us cover R? by the open set R? — L and by 
all basis elements of the form 


(a, b) x [-a, d). 


Each of these open sets intersects L in at most one point. Since L is uncountable, no 
countable subcollection covers R? See Figure 30.1. 


[a,b) x [-a,d) 


Figure 30.1 


EXAMPLES. A subspace of a Lindelöf space need not be Lindelöf. The ordered square GC 
is compact; therefore it is Lindelöf, trivially. However, the subspace A = / x (0, 1) is not 
Lindelöf. For A is the union of the disjoint sets U, = {x} x (0, 1), each of which is open 
in A. This collection of sets is uncountable, and no proper subcollection covers A. 
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Exercises 


1. 


© CG N A 


10. 


11. 


12. 


13. 


14. 
15. 


(a) A Gs set in a space X is a set A that equals a countable intersection of open 
sets of X. Show that in a first-countable T) space, every one-point set is a 
Gs set. 

(b) There is a familiar space in which every one-point set is a Gs set, which 
nevertheless does not satisfy the first countability axiom. What is it? 

The terminology here comes from the German. The “G” stands for “Gebiet,” 

which means “open set,” and the “8” for “Durchschnitt,” which means “intersec- 

tion.” 


. Show that if X has a countable basis (B,}, then every basis C for X contains 


a countable basis for X. (Hint: For every pair of indices n, m for which it is 
possible, choose Crm € C such that Bn C Cam C Bm.) 


. Let X have a countable basis; let A be an uncountable subset of X. Show that 


uncountably many points of A are limit points of A. 


. Show that every compact metnzable space X has a countable basis. (Hint: 


Let A, be a finite covering of X by 1/n-balls.} 


. (a) Show that every metnzable space with a countable dense subset has a count- 


able basis. 
(b) Show that every metnzable Lindelof space has a countable basis. 


. Show that Ry and /? are not metnizable. 

. Which of our four countability axioms does Sg satisfy? What about Sq? 

. Which of our four countability axioms does R® in the uniform topology satisfy? 
. Let A be aclosed subspace of X. Show that if X is Lindelof, then A is Lindelof. 


Show by example that if X has a countable dense subset, A need not have a 
countable dense subset. 


Show that if X is a countable product of spaces having countable dense subsets, 
then X has a countable dense subset. 


Let f : X — Y be continuous. Show that if X is Lindelof, or if X has a 
countable dense subset, then f(X) satisfies the same condition. 


Let f : X — Y bea continuous open map. Show that if X satisfies the first or 
the second countability axiom, then f(X) satisfies the same axiom. 


Show that if X has a countable dense subset, every collection of disjoint open 
sets in X is countable. 


Show that if X is Lindelöf and Y is compact, then X x Y is Lindelöf. 


Give R’ the uniform metric, where / = [0, 1]. Let @(/, R) be the subspace con- 
sisting of continuous functions. Show that C(/, R) has a countable dense subset, 
and therefore a countable basis. [Hint: Consider those continuous functions 
whose graphs consist of finitely many line segments with rational end points. } 
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16. (a) Show that the product space R’, where J = (0, 1], has a countable dense 
subset. 
(b) Show that if J has cardinality greater than P (Z4), then the product space R7 
does not have a countable dense subset. {Hint: If D is dense in R”, define 
f : J > P(D) by the equation f(a) = DN xz! (a, b)), where (a, b) isa 
fixed interval in R.} 
*17. Give R® the box topology. Let Q% denote the subspace consisting of sequences 
of rationals that end in an infinite string of 0’s. Which of our four countability 
axioms does this space satisfy? 


*18. Let G be a first-countable topological group. Show that if G has a countable 
dense subset, or is Lindelöf, then G has a countable basis. {Hint: Let {B,} be a 
countable basis at e. If D is a countable dense subset of G, show the sets d Bn, 
for d € D, forma basis for G. If G is Lindelof, choose for each n a countable set 
Cn such that the sets cB,, for c € Cn, cover G. Show that as n ranges over Z}, 
these sets form a basis for G.} 


§31 The Separation Axioms 


In this section, we introduce three separation axioms and explore some of their prop- 
erties. One you have already seen—the Hausdorff axiom. The others are similar but 
stronger. As always when we introduce new concepts, we shall examine the relation- 
ship between these axioms and the concepts introduced earlier in the book. 

Recall that a space X is said to be Hausdorff if for each pair x, y of distinct points 
of X, there exist disjoint open sets containing x and y, respectively. 


Definition. Suppose that one-point sets are closed in X. Then X is said to be reg- 
ular if for each pair consisting of a point x and a closed set B disjoint frorn x, there 
exist disjoint open sets containing x and B, respectively. The space X is said to be 
normal if for each pair A, B of disjoint closed sets of X, there exist disjoint open sets 
containing A and B, respectively. 


It is clear that a regular space is Hausdorff, and that a normal space is regular. 
(We need to include the condition that one-point sets be closed as part of the definition 
of regularity and normality in order for this to be the case. A two-point space in the 
indiscrete topology satisfies the other part of the definitions of regularity and normality, 
even though it is not Hausdorff.) For examples showing the regulanty axiorn stronger 
than the Hausdorff axiom, and normality stronger than regularity, see Examples 1 
and 3. 

These axioms are called separation axioms for the reason that they involve “sepa- 
rating” certain kinds of sets from one another by disjoint open sets. We have used the 
word “separation” before, of course, when we studied connected spaces. But in that 
case, we were trying to find disjoint open sets whose union was the entire space. 
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The present situation is quite different because the open sets need not satisfy this 
condition. 


E (Sea 


Hausdorff Regular Normal 


Figure 31.1 


The three separation axioms are illustrated in Figure 31.1. 
There are other ways to formulate the separation axioms. One formulation that is 
sometimes useful is given in the following lemma: 


Lemma 31.1. Let X be a topological space. Let one-point sets in X be closed. 

(a) X is regular if and only if given a point x of X and a neighborhood U of x, 
there is a neighborhood V of x such that V C U. 

(b) X is normal if and only if given a closed set A and an open set U containing A, 
there is an open set V containing A such that V C U. 


Proof. (a) Suppose that X is regular, and suppose that the point x and the neighbor- 
hood U of x are given. Let B = X — U; then B is a closed set. By hypothesis, there 
exist disjoint open sets V and W containing x and B, respectively. The set V is disjoint 
from B, since if y € B, the set W is a neighborhood of y disjoint from V. Therefore, 
V c U, as desired. 

To prove the converse, suppose the point x and the closed set B not containing x 
are given. Let U = X — B. By hypothesis, there is a neighborhood V of x such 
that V C U. The open sets V and X — V are disjoint open sets containing x and B, 
respectively. Thus X is regular. 

(b) This proof uses exactly the same argument; one just replaces the point x by the 
set A throughout. a 


Now we relate the separation axioms with the concepts previously introduced. 


Theorem 31.2. (a) A subspace of a Hausdorff space is Hausdorff; a product of Haus- 
dorff spaces is Hausdorff. 
(b) A subspace of a regular space is regular; a product of regular spaces is regular. 
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Proof. (a) This result was an exercise in §17. We provide a proof here. Let X be 
Hausdorff. Let x and y be two points of the subspace Y of X. If U and V are disjoint 
neighborhoods in X of x and y, respectively, then U N Y and V N Y are disjoint 
neighborhoods of x and y in Y. 

Let {Xa} be a family of Hausdorff spaces. Let x = (xq) and y = (ya) be distinct 
points of the product space [| Xa. Because x # y, there is some index £ such that 
Xp # yg. Choose disjoint open sets U and V in Xg containing xg and yg, respectively. 
Then the sets Tex (U) and Tp" (V) are disjoint open sets in [] Xa containing x and y, 
respectively. 

(b) Let Y be a subspace of the regular space X. Then one-point sets are closed 
in Y. Let x be a point of Y and let B be a closed subset of Y disjoint from x. Now 
BOY = B, where B denotes the closure of B in X. Therefore, x ¢ B, so, using 
regulanty of X, we can choose disjoint open sets U and V of X containing x and B, 
respectively. Then U N Y and V N Y are disjoint open sets in Y containing x and B, 
respectively. 

Let {Xq} be a family of regular spaces; let X = [] Xa. By (a), X is Hausdorff, so 
that one-point sets are closed in X. We use the preceding lemma to prove regularity 
of X. Let x = (xa) be a point of X and let U be a neighborhood of x in X. Choose a 
basis element [] Ug about x contained in U. Choose, for each a, a neighborhood Vy 
of x, in Xq such that Va C Ua; if it happens that Uy = Xa, choose Vy = Xa. Then 
V = [[ Va is a neighborhood of x in X. Since V = [| Vo by Theorem 19.5, it follows 
at once that V C [] Ua C U, so that X is regular. É 


There is no analogous theorem for normal spaces, as we shall see shortly, in this 
section and the next. 


EXAMPLE! The space Rx is Hausdorff but not regular. Recall that Rx denotes the reals 
in the topology having as basis all open intervals (a, b) and all sets of the form (a, b) — K, 
where K = (I/n | n € Z4}. This space is Hausdorff, because any two distinct points have 
disjoint open intervals containing them. 

But it is not regular. The set X is closed in Rx, and it does not contain the point 0. 
Suppose that there exist disjoint open sets U and V containing O and K, respectively. 
Choose a basis element containing 0 and lying in U. It must be a basis element of the form 
(a, b) — K, since each basis element of the form (a, b) containing 0 intersects K . Choose n 
large enough that 1/n € (a,b). Then choose a basis element about |/n contained in V; 
it must be a basis element of the form (c,d). Finally, choose z so that z < 1/n and 
z > max{c, 1/(2 + 1)}. Then z belongs to both U and V, so they are not disjoint. See 
Figure 31.2 


Figure 31.2 
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EXAMPLE 2. The space Re is normal It is immediate that one-point sets are closed 
in Rz, since the topology of Rz is finer than that of R. To check normality, suppose that A 
and B are disjoint closed sets in R; For each point a of A choose a basis element [a, x4) not 
intersecting B, and for each point b of B choose a basis element [b, xp) not intersecting A. 
The open sets 


U =| Jia xa) and V= (jib. xo) 


aéA beB 


are disjoint open Sets about A and B, respectively. 


EXAMPLE3 The Sorgenfrey plane R} is not normal 

The space Rç is regular (in fact, normal), so the product space R? is alSo regular. Thus 
this example serves two purposes. It shows that a regular space need not be normal, and it 
shows that the product of two normal spaces need not be normal 

We suppose R? is normal and denve a contradiction Let L be the subspace of R? 
consisting of all points of the form x x (—x). Then L is closed in Ri, and L has the 
discrete topology. Hence every subset A of L, being closed in L, is closed in R- Because 
L — A is also closed in R?, this means that for every nonempty proper subset A of L, one 
can find disjoint open sets U 4 and V4 containing A and L — A, respectively 

Let D denote the set of points of R? having rational coordinates; it is dense in Ri. We 
define a map 6 that assigns, to each subset of the line L, a subset of the set D, by setting 


O(A)=DNU, if OGAEL, 
6(@) =Ø, 
6(L) = D. 


We show that 8 - P(L) + P(D) is injective. 

Let A be a proper nonempty subset of L. Then 6(A) = DOU, is neither empty (since 
Ua is open and D is dense in R}) nor all of D (since D N Va is nonempty). It remains to 
show that if B is another proper nonempty subset of L, then 6(A) # 0(B). 

One of the sets A, B contains a point not in the other; suppose that x € A and x ¢ B. 
Then x € L — B, so that x € Ug N Vg; since the latter set is open and nonempty, it must 
contain points of D These points belong to U4 and not to Ug, therefore, DOU, 4 DNUs, 
as desired. Thus @ is injective 

Now we show there exists an injective map @ : P(D) — L. Because D is countably 
infinite and L has the cardinality of R, it suffices to define an injective map Y of P(Z,) 
into R. For that, we let y assign to the subset S of Z+ the infinite decimal .a1a2 . . . , where 
a, = Oifi € Sanda, = | if i ¢ S. That is, 

Co 


W(S) = D7 ai/10' 


i=l 
Now the composite 
e id 
P(L) — P(D) ——L 


is an injective map of P(L) into L. But Theorem 7.8 tells us such a map does not exist! 
Thus we have reached a contradiction 
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This proof that R? is not normal is in some ways not very satisfying. We showed 
only that there must exist some proper nonempty subset A of L such that the sets A and 
B = L — A are not contained in disjoint open sets of R2. But we did not actually find such 
a set A. In fact, the set A of points of L having rational coordinates is such a set, but the 
proof is not easy. It is left to the exercises. 


Exercises 


1. 


Show that if X is regular, every pair of points of X have neighborhoods whose 
closures are disjoint. 


. Show that if X is normal, every pair of disjoint closed sets have neighborhoods 


whose closures are disjoint. 


. Show that every order topology is regular. 
. Let X and X’ denote a single set under two topologies 7 and J’, respectively; 


assume that 7’ > 7. If one of the spaces is Hausdorff (or regular, or normal), 
what does that imply about the other? 


. Let f,g : X — Y be continuous; assume that Y is Hausdorff. Show that {x | 


f (x) = g(x)} is closed in X. 


. Let p : X — Y be a closed continuous surjective map. Show that if X is normal, 


then so is Y. [Hint: If U is an open set containing poy}, show there is a 
neighborhood W of y such that p-!(W) C U.} 


. Let p : X — Y be a closed continuous surjective map such that p—!({y}) is 


compact for each y € Y. (Such a map is called a perfect map.) 

(a) Show that if X is Hausdorff, then so is Y. 

(b) Show that if X is regular, then so is Y. 

(c) Show that if X is locally compact, then so is Y. 

(d) Show that if X is second-countable, then so is Y. {Hint: Let B be a countable 
basis for X. For each finite subset J of B, let U; be the union of all sets of 
the form p~'(W), for W open in Y, that are contained in the union of the 
elements of J.] 


. Let X be a space; let G be a topological group. An action of G on X is a 


continuous map a : G x X — X such that, denoting a(g x x) by g - x, one has: 
(t) e-x =x forall x € X. 


(ii) gı - (g2- x) = (g1 - 82) - x for all x € X and g), g2 E G. 
Define x ~ g- x for all x and g; the resulting quotient space is denoted X /G and 
called the orbit space of the action a. 
Theorem. Let G be a compact topological group; let X be a topological space; 
let æ be an action of G on X. If X is Hausdorff, or regular, or normal, or locally 
compact, or second-countable, so is X/G. 
(Hint: See Exercise 13 of §26.] 
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*9, Let A be the set of all points of R? of the form x x (—x), for x rational; let B be 
the set of all points of this form for x irrational. If V is an open set of R? con- 
taining B, show there exists no open set U containing A that is disjoint from V, 
as follows: 

(a) Let K, consist of all irrational numbers x in [0, 1} such that (x, x + 1/n) x 
[—x, —x + 1/n) is contained in V. Show [0, 1] is the union of the sets Kn 
and countably many one-point sets. 

(b) Use Exercise 5 of §27 to show that some set K, contains an open interval 
(a, b) of R. 

(c) Show that V contains the open parallelogram consisting of all points of the 

form x x (—x + €) for whicha < x < bandO < € < l/n. 

Conclude that if q is a rational number witha < q < b, then the point 

q x (—q) of R? is a limit point of V. 


(d 


E= 


§32 Normal Spaces 


Now we turn to a more thorough study of spaces satisfying the normality axiom. In 
one sense, the term “norma!” is something of a misnomer, for normal spaces are not as 
well-behaved as one might wish. On the other hand, most of the spaces with which we 
are familiar do satisfy this axiom, as we shall see. Its importance comes from the fact 
that the results one can prove under the hypothesis of normality are central to much of 
topology. The Urysohn metrization theorem and the Tietze extension theorem are two 
such results; we shall deal with them Jater in this chapter. 

We begin by proving three theorems that give three important sets of hypotheses 
under which normality of a space is assured. 


Theorem 32.1. Every regular space with a countable basis is normal. 


Proof. Let X be a regular space with a countable basis B. Let A and B be disjoint 
closed subsets of X. Each point x of A has a neighborhood U not intersecting B. Using 
regularity, choose a neighborhood V of x whose closure lies in U; finally, choose an 
element of B containing x and contained in V. By choosing such a basis element for 
each x in A, we construct a countable covenng of A by open sets whose closures do 
not intersect B. Since this covering of A is countable, we can index it with the positive 
integers; let us denote it by {Un}. 

Similarly, choose a countable collection {V,} of open sets covering B, such that 
each set V, is disjoint from A. The sets U = |J Un and V = [J Vp are open sets con- 
taining A and B, respectively, but they need not be disjoint. We perform the following 
simple trick to construct two open sets that ave disjoint. Given n, define 


n n 
UL=Un-\JW an v= -| Jū:. 
i=} i=} 
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Note that each set U; is open, being the difference of an open set Un, and a closed set 
Uf- Vi. Similarly, each set V; is open. The collection {U/} covers A, because each 
x in A belongs to Un for some n, and x belongs to none of the sets V,. Similarly, the 
collection {V„} covers B. See Figure 32.1. 


Figure 32.1 


Finally, the open sets 
ua | JU, ad v= | |v 
néeZy neZ, 


are disjoint. For if x € U’ N V’, then x € U, N V; for some j and k. Suppose that 
j < k. It follows from the definition of U; that x € U,; and since j < k it follows 
from the definition of Vj that x ¢ U;. A similar contradiction arises if j > k. a 
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Theorem 32.2. Every metrizable space is normal. 


Proof. Let X be a metrizable space with metric d. Let A and B be disjoint closed 
subsets of X. For each a € A, choose €; So that the ball B(a, €a) does not intersect B. 
Similarly, for each b in B, choose e€», so that the ball B(b, €,) does not intersect A. 
Define 


U =|] B(a.e0/2) and V =| J Blb, 64/2). 


acA beB 


Then U and V are open sets containing A and B, respectively; we assert they are 
disjoint. For if z € U N V, then 


z € B(a, €a/2) N B(b, €b/2) 


for some a € A and some b € B The tnangle inequality applies to show that 
d(a,b) < (€a + €)/2. If €a < €p, then d(a,b) < €b, so that the ball B(b, €p) 
contains the point a. If € < €a, then d(a, b) < €a, so that the ball B(a, €a) contains 
the point b. Neither situation is possible. a 


Theorem 32.3. Every compact Hausdorff space is normal. 


Proof. Let X be a compact Hausdorff space. We have already essentially proved 
that X is regular. For if x is a point of X and B is a closed set in X not containing x, 
then B is compact, so that Lemma 26.4 applies to show there exist disjoint open sets 
about x and B, respectively. 

Essentially the same argument as given in that lemma can be used to show that X 
is normal: Given disjoint closed sets A and B in X, choose, for each point a of A, 
disjoint open sets Ua and Va containing a and B, respectively. (Here we use regularity 
of X.) The collection {Ua} covers A; because A is compact, A may be covered by 
finitely many sets Ug,,..., Uam- Then 


U =Ug,U+--UU,, and V SVa NN Van 
are disjoint open sets containing A and B, respectively. a 


Here is a further result about normality that we shall find useful in dealing with 
some examples. 


Theorem 32.4. Every well-ordered set X ts normal tn the order topology. 


It is, in fact, true that every order topology is normal (see Example 39 of {S-S]); 
but we shall not have occasion to use this stronger result. 
Proof. Let X be a well-ordered set. We assert that every interval of the form (x, y} 
is open in X. If X has a largest element and y is that element, (x, y] is just a basis 
element about y. If y is not the largest element of X, then (x, y} equals the open set 
(x, y’), where y’ is the immediate successor of y. 
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Now let A and B be disjoint closed sets in X; assume for the moment that neither A 
nor B contains the smallest element ag of X. For each a € A, there exists a basis 
element about a disjoint from B; it contains some interval of the form (x, a]. (Here 
is where we use the fact that a is not the smallest element of X.) Choose, for each 
a € A, such an interval (x4, a} disjoint from B. Similarly, for each b € B, choose an 
interval (yp, b} disjoint from A. The sets 


U =| æa a] and V=( jO. b 


acA beB 


are open sets containing A and B, respectively; we assert they are disjoint. For suppose 
that z € UNV. Then z € (xg, a] N (yp, b] for some a € A and some b € B. Assume 
thata < b. Then if a < yp, the two intervals are disjoint, while if a > yp, we have 
a € (yp, bÌ, contrary to the fact that (yp, b} is disjoint from A. A similar contradiction 
occurs if b < a. 

Finally, assume that A and B are disjoint closed sets in X, and A contains the 
smallest element ag of X. The set {ap} is both open and closed in X. By the result of 
the preceding paragraph, there exist disjoint open sets U and V containing the closed 
sets Á — {ao} and B, respectively. Then U U{ao} and V are disjoint open sets containing 
A and B, respectively a 


EXAMPLE i. If J is uncountable, the product space R? is not normal. The proof is 
fairly difficult; we leave it as a challenging exercise (see Exercise 9). 

This example serves three purposes. It shows that a regular space R/ need not be 
normal. It shows that a subspace of a normal space need not be normal, for R/ is home- 
omorphic to the subspace (0, 1)/ of (0, iy, which (assuming the Tychonoff theorem) is 
compact Hausdorff and therefore normal And it shows that an uncountable product of 
normal spaces need not be normal. It leaves unsettled the question as to whether a finite or 
a countable product of normal spaces nught be normal. 


EXAMPLE 2. The product space Sg x Sg is not normal. 

Consider the well-ordered set So, in the order topology, and consider the subset Sg, in 
the subspace topology (which is the same as the order topology). Both spaces are normal, 
by Theorem 32.4. We shall show that the product space Sg x Sq is not normal. 

This example serves three purposes. First, it shows that a regular space need not be 
normal, for Sq x Sq is a product of regular spaces and therefore regular. Second, it shows 
that a subspace of a normal space need not be normal, for So x Sai is a subspace of Sa x Sa, 
which is a compact Hausdorff space and therefore normat Third, it shows that the product 
of two normal spaces need not be normal. 

First, we consider the space Sa x Sa. and its “diagonal” A = (x xx | x € Sa}. 
Because Sg is Hausdorff, A is closed in Sg x Sg. If U and V are disjoint neighborhoods 
of x and y, respectively, then U x V is a neighborhood of x x y that does not intersect A. 

Therefore, in the subspace Sg x Sa, the set 


A = AN (So x Sg) = A — (9 x Q} 


t Kelley [K] attributes this example to J. Dieudonné and A. P Morse independently 
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xx Ow 8 


Figure 32.2 


is closed. Likewise, the set 
B = Sg x {Q} 


is closed in Sq x Sq, being a “slice” of this product space. The sets A and B are disjoint. 
We shall assume there exist disjoint open sets U and V of Sg x Sg containing A and B, 
respectively, and derive a contradiction. See Figure 32.2. 

Given x € Sg, consider the vertical slice x x Sg. We assert that there is some point 8 
with x < B < Q such that x x £ lies outside U For if U contained all points x x £ for 
x < B < Q, then the top point x x & of the slice would be a limit point of U, which it is 
not because V is an open set disjoint from U containing this top point 

Choose (x) to be such a point; just to be definite, let (x) be the smallest element 
of Sq such that x < B(x) < Q and x x B(x) lies outside U. Define a Sequence of points 
of Sg as follows: Let x; be any point of So Let x2 = B(x1), and in general, x,41 = (Xn) 
We have 


IUCN., 


because B(x) > x forall x. The set {xn} is countable and therefore has an upper bound 
in Sg; let b € Sg be its least upper bound. Because the sequence is increasing, it must 
converge to its least upper bound; thus x, —> b But (xn) = 4141, SO that B(x,) > b 
also. Then 


Xn X B(xa) — bx b 


in the product space. See Figure 32.3. Now we have a contradiction, for the point b x b 
lies in the set A, which is contained in the open set U; and U contains none of the points 
Xn X B(Xn). 
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Figure 32.3 


Exercises 


1. Show that a closed subspace of a normal space is normal. 


2. Show that if [] Xq is Hausdorff, or regular, or normal, then so is Xe. (Assume 
that each Xa is nonempty.) 


3. Show that every locally compact Hausdorff space is regular. 
4. Show that every regular Lindelof space is normal. 


5. Is R” normal in the product topology? In the uniform topology? 

It is not known whether R® is normal in the box topology. Mary-Ellen Rudin 
has shown that the answer is affirmative if one assumes the continuum hypothe- 
sis [RM]. In fact, she shows it satisfies a stronger condition called paracompact- 
ness. 


6. A space X is said to be completely normal if every subspace of X is normal. 
Show that X is completely normal if and only if for every pair A, B of separated 
sets in X (that is, sets such that ANB = Gand ANB = @), there exist 
disjoint open sets containing them. (Hint: If X is completely normal, consider 
X ~(ANB).} 

7. Which of the following spaces are completely normal? Justify your answers. 

(a) A subspace of a completely normal space. 

(b) The product of two completely normal spaces. 
(c) A well-ordered set in the order topology. 

(d) A metrizable space. 
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(e) A compact Hausdorff space. 

(Ð A regular space with a countable basis. 

(g) The space Re. 

Prove the following: 

Theorem. Every linear continuum X ts normal. 

(a) Let C be a nonempty closed subset of X. If U is a component of X —C, show 
that U is a set of the form (c, c’) or (c, 00) or (—00, c), wherec,c’ € C. 

(b) Let A and B be closed disjoint subsets of X. For each component W of 
X — A U B that is an open interval with one end point in A and the other 
in B, choose a point cw of W. Show that the set C of the points cw is closed. 

(c) Show that if V is a component of X — C, then V does not intersect both A 
and B. 


Prove the following: 

Theorem. If J is uncountable, then R/ is not normal. 

Proof. (This proof is due to A. H. Stone, as adapted in (S-S].) Let X = (Z4)/; it 

will suffice to show that X is not normal, since X is a closed subspace of R7. We 

use functional notation for the elements of X, so that the typical element of X is 

a function x : J > Z4 

(a) If x € X and if B is a finite subset of J, let U(x, B) denote the set consisting 
of all those elements y of X such that y(w) = x(a) for æ € B. Show the sets 
U(x, B) are a basis for X. 

(b) Define P, to be the subset of X consisting of those x such that on the set 
J —x7!(n), the map x is injective. Show that Pı and Pz are closed and 


disjoint. 
(c) Suppose U and V are open sets containing P) and P2, respectively. Given a 
sequence a}, a2, ... of distinct elements of J, and a sequence 


O=no <n <n <-- 
of integers, for each i > | let us set 
Bi = fæ), , an,} 
and define x; € X by the equations 


Xi(aj)=jf forl<j<aj-i, 
x;(a)=1 forall other values of a. 


Show that one can choose the sequences a; and nj so that for each i, one 
has the inclusion 


U(x, Bi) CU. 


(Hint: To begin, note that x)(a@) = 1 for all a; now choose B; so that 
U(x, Bi) Cc U.] 
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(d) Let A be the set {œ1, œ2,.. } constructed in (c) Define y : J + Z, by the 
equations 


yaj)=j fora; eA, 
y(a) =2 forall other values of æ. 


Choose B so that U(y, B) C V. Then choose i so that B N A is contained 
in the set B;. Show that 


U(xi+1, Bi+1) N U (y, B) 


is not empty. 
10. Is every topological group normal? 


§33 The Urysohn Lemma 


Now we come to the first deep theorem of the book, a theorem that is commonly 
called the “Urysohn lemma.” It asserts the existence of certain real-valued continuous 
functions on a normal space X. It is the crucial tool used in proving a number of 
important theorems. We shall prove three of them—the Urysohn metrization theorem, 
the Tietze extension theorem, and an imbedding theorem for manifolds—in succeeding 
sections of this chapter. 

Why do we call the Urysohn lemma a “deep” theorem? Because its proof involves 
a really original idea, which the previous proofs did not. Perhaps we can explain 
what we mean this way: By and large, one would expect that if one went through this 
book and deleted all the proofs we have given up to now and then handed the book 
to a bright student who had not studied topology, that student ought to be able to go 
through the book and work out the proofs independently. (It would take a good deal of 
time and effort, of course; and one would not expect the student to handle the trickier 
examples.) But the Urysohn lemma is on a different level. It would take considerably 
more originality than most of us possess to prove this lemma unless we were given 
copious hints! 


Theorem 33.1 (Urysohn lemma). Let X be a normal space, let A and B be disjoint 
closed subsets of X. Let {a, b] be a closed interval in the real line. Then there exists a 
continuous map 


f:X — [a,b] 
such that f (x) = a for every x in A, and f(x) = b for every x in B. 


Proof. We need consider only the case where the interval in question is the interval 
[0, 1]; the general case follows from that one. The first step of the proof is to con- 
struct, using normality, a certain family Up of open sets of X, indexed by the rational 
numbers. Then one uses these sets to define the continuous function f. 
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Step 1. Let P be the set of all rational numbers in the interval (0, 1] We shall 
define, for each p in P, an open set Up of X, in such a way that whenever p < q, we 
have 


Up C U; 


Thus, the sets U, will be simply ordered by inclusion in the same way their subscripts 
are ordered by the usual ordering in the real line. 

Because P is countable, we can use induction to define the sets Up (or rather, the 
pmnciple of recursive definition). Arrange the elements of P in an infinite sequence in 
some way; for convenience, let us suppose that the numbers 1 and 0 are the first two 
elements of the sequence. 

Now define the sets Up, as follows. First, define U; = X — B. Second, because A 
is a closed set contained in the open set U;, we may by normality of X choose an open 
set Uo such that 


ACU, and Uo CUI 


In general, let P, denote the set consisting of the first n rational numbers in the 
sequence. Suppose that U, is defined for all rational numbers p belonging to the 
set Pa, satisfying the condition 


(x) P <4 = Üp C Ug- 


Let r denote the next rational number in the sequence; we wish to define U, 

Consider the set Payı = Pn U {r}. Itis a finite subset of the interval [0, 1], and, as 
such, it has a simple ordering denved from the usual order relation < on the real line. 
In a finite simply ordered set, every element (other than the smallest and the largest) 
has an immediate predecessor and an immediate successor. (See Theorem 10.1 ) The 
number 0 is the smallest element, and | is the largest element, of the simply ordered 
set P,4 1, andr is neither Onor 1 Sor has an immediate predecessor p in P41 and an 
immediate successor q in P,+, The sets U, and U; are already defined, and U p CUg 
by the induction hypothesis. Using normality of X, we can find an open set U, of X 
such that 


U, CU, and U, C Uy. 


We assert that (x) now holds for every pair of elements of P,,1. If both elements lie 
in Pa, (*) holds by the induction hypothesis. If one of them is r and the other is a point 
s of Ph, then either s < p, in which case 


U, Cc Ü, Cc U,, 
ors > q, in which case 
Ü, C Uy C Us. 


t Actually, any countable dense subset of [0, 1] will do, providing it contains the points 0 and 1. 
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Thus, for every pair of elements of P41, relation (*) holds. 
By induction, we have U, defined for all p € P. 
To illustrate, let us suppose we started with the standard way of arranging the elements 
of P in an infinite sequence 


_ 11213123 
PH(lO3,.35.9 7555 5 3 


After defining Uo and U, we would define U;,2 so that Üo c Ui and 0,2 C U, Then 
we would fit in U1/3 between Uo and Uj ;2, and U2/3 between U;/2 and U1. And so on. At 
the eighth step of the proof we would have the situation pictured in Figure 33 1 And the 
ninth step would consist of choosing an open set U2,5 to fit in between U1;3 and U,;2 And 
so on 


Figure 33.1 


Step 2. Now we have defined U, for all rational numbers p in the interval {0, 1]. 
We extend this definition to all rational numbers p in R by defining 


Up= ifp<0O, 
U,=X ifp>1 


It is still true (as you can check) that for any pair of rational numbers p and q, 
p<q=> Ü, CU, 


Step 3. Given a point x of X, let us define Q(x) to be the set of those rational 
numbers p such that the corresponding open sets Up contain x 


Q(x) = {plx € Up). 
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This set contains no number less than 0, since no x is in U, for p < 0. And it contains 

every number greater than 1, since every x is in Up for p > 1. Therefore, Q(x) is 

bounded below, and its greatest lower bound is a point of the interval {0, 1]. Define 
f(x) = inf Q(x) = inf{p | x € Up}. 


Step 4 We show that f is the desired function. If x € A, then x € Up for every 
p = 0, so that Q(x) equals the set of all nonnegative rationals, and f(x) = inf Q(x) = 
0. Similarly, if x € B, then x € U, for no p < 1, so that Q(x) consists of all rational 
numbers greater than 1, and f(x) = 1. 
All this is easy. The only hard part is to show that f is continuous. For this 
purpose, we first prove the following elementary facts: 
(1) x €U, = f(x) <r 
(2) x ¢ U, > fx) 2r. 
To prove (1), note that if x € U,, then x € U, for every s > r. Therefore, Q(x) 
contains all rational numbers greater than r, so that by definition we have 


fx) = inf Q(x) sr 


To prove (2), note that if x ¢ U,, then x is not in U, for any s < r. Therefore, Q(x) 
contains no rational numbers less than r, so that 


f(x) = inf Q(x) 2 r. 
Now we prove continuity of f. Given a point xo of X and an open interval (c, d) 


in R containing the point f(x9), we wish to find a neighborhood U of xo such that 
f(U) C (e, d). Choose rational numbers p and q such that 


c<p< f(x) <q <d. 
We assert that the open set 
U=U,-U0, 
is the desired neighborhood of xo. See Figure 33.2. 


f 
mna 
p q 
c t) d 
Figure 33.2 


First, we note that xọ € U For the fact that f (xọ) < q implies by condition (2) 
that xo € U,, while the fact that f (xo) > p implies by (1) that xo ¢ Up. - 

Second, we show that f(U) Ç (c,d). Letx € U. Then x € UZ C U,, so 
that f(x) < q, by (1). And x ¢ Up, so that x ¢ Up, and f(x) > p, by (2). Thus, 
F(x) € [p,q] C (c,d), as desired. a 
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Definition. If A and B are two subsets of the topological space X, and if there is a 
continuous function f . X — [0,1] such that f(A) = {0} and f(B) = {1}, we say 
that A and B can be separated by a continuous function. 


The Urysohn lemma says that if every pair of disjoint closed sets in X can be 
separated by disjoint open sets, then each such pair can be separated by a continuous 
function. The converse is trivial, for if f : X — [0, 1] is the function, then f~!({0, 5)) 
and FNG, 1]) are disjoint open sets containing A and B, respectively 

This fact leads to a question that may already have occurred to you: Why cannot 
the proof of the Urysohn lemma be generalized to show that in a regular space, where 
you can separate points from closed sets by disjoint open sets, you can also separate 
points from closed sets by continuous functions? 

At first glance, it seems that the proof of the Urysohn lemma should go through. 
You take a point a and a closed set B not containing a, and you begin the proof 
just as before by defining U; = X — B and choosing Uo to be an open set about a 
whose closure is contained in U; (using regularity of X). But at the very next step 
of the proof, you run into difficulty. Suppose that p is the next rational number in 
the sequence after 0 and 1. You want to find an open set U, such that Uo c Up and 
Up C U. For this, regularity is not enough. 

Requiring that one be able to separate a point from a closed set by a continuous 
function is, in fact, a stronger condition than requinng that one can separate them by 
disjoint open sets. We make this requirement into a new separation axiom: 


Definition. A space X is completely regular if one-point sets are closed in X and 
if for each point xo and each closed set A not containing xo, there is a continuous 
function f : X — [0, 1] such that f(xo) = 1 and f(A) = {0}. 


A normal space is completely regular, by the Urysohn lemma, and a completely 
regular space is regular, since given f, the sets f'U, 1)) and FNG, }]) are dis- 
joint open sets about A and xo, respectively As a result, this new axiom fits in between 
regularity and normality in the list of separation axioms. Note that in the definition one 
could just as well require the function to map xo to 0, and A to {1}, for g(x) = 1— f(x) 
satisfies this condition. But our definition is at times a bit more convenient. 

In the early years of topology, the separation axioms, listed in order of increasing 
strength, were labelled 71, 72 (Hausdorff), T3 (regular), T4 (normal), and 75 (com- 
pletely normal), respectively. The letter “T” comes from the German “Trennungsax- 
iom,” which means “separation axiom.” Later, when the notion of complete regular- 
ity was introduced, someone suggested facetiously that it should be called the “T-34 
axiom,” since it lies between regularity and normality. This terminology is in fact 
sometimes used in the literature! 

Unlike normality, this new separation axiom is nicely behaved with regard to sub- 
spaces and products: 


Theorem 33.2. A subspace of a completely regular space is completely regular. A 
product of completely regular spaces is completely regular. 
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Proof. Let X be completely regular; let Y be a subspace of X. Let xo be a point of Y, 
and let A be a closed set of Y disjoint from xo. Now A = AN Y, where A denotes the 
closure of A in X. Therefore, xo ¢ A. Since X is completely regular, we can choose 
a continuous function f : X — [0,1] such that f(xo) = 1 and f(A) = {0}. The 
restriction of f to Y is the desired continuous function on Y. 

Let X = [] Xa be a product of completely regular spaces. Let b = (ba) be a point 
of X and let A be a closed set of X disjoint from b. Choose a basis element [| Ug 
containing b that does not intersect A; then Uy = Xa except for finitely many æ, say 
a@=a,...,@,.Giveni = 1,..., n, choose a continuous function 


fi: Xa; > (0,1) 


such that f; (ba,) = l and f;(X —Ua,) = {0}. Let ġ; (x) = fi (te, (x)); then ġ; maps X 
continuously into R and vanishes outside Tg. (Ua, ). The product 


fO = 1%) G2(%)- > dalX) 
is the desired continuous function on X, for it equals 1 at b and vanishes outside [] Ua. 


EXAMPLE 1. The spaces R$ and Sg x Sq are completely regular but not normal For 
they are products Of spaces that are completely regular (in fact, normal). 

A Space that is regular but not completely regular is much harder to find. Most of 
the examples that have been constructed for this purpose are difficult, and require consid- 
erable famuliarity with cardinal numbers. Fairly recently, however, John Thomas [T] has 
constructed a much more elementary example, which we outline in Exercise 11. 


Exercises 


1. Examine the proof of the Urysohn lemma, and show that for given r, 


f'n=(\u,- ua. 


p>r q<r 
p. q rational. 
2. (a) Show that a connected normal space having more than one point is uncount- 
able. 


(b) Show that a connected regular space having more than one point is uncount- 
able.! [Hint: Any countable space is Lindel6f.} 


3. Give a direct proof of the Urysohn lemma for a metric space (X,d) by setting 
d(x, A) 


fO = Fad) ede) 


tSurpnsingly enough, there does exist a connected Hausdorff space that is countably infinite See 
Example 75 of {S-S} 
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. Recall that A is a “Gg set” in X if A is the intersection of a countable collection 


of open sets of X. 
Theorem. Let X be normal. There exists a continuous function f : X — [0, 1] 
such that f(x) = 0 for x € A, and f(x) > 0 forx ¢ A, if and only if A is a 
closed G; set in X. 

A function satisfying the requirements of this theorem is said to vanish pre- 
cisely on A. 


. Prove: 


Theorem (Strong form of the Urysohn lemma). Let X be a normal space. There 
is a continuous function f : X — [0, 1] such that f(x) = 0 forx € A, and 
f(x) = 1 forx e B, and0 < f(x) < | otherwise, if and only if A and B are 
disjoint closed G; sets in X. 


. A space X is said to be perfectly normal if X is normal and if every closed set 


in X is a Gg set in X. 

(a) Show that every metrizable space is perfectly normal. 

(b) Show that a perfectly normal space is completely normal. For this reason the 
condition of perfect normality is sometimes called the “Tę axiom.” [Hint: 
Let A and B be separated sets in X. Choose continuous functions f, g : 
X — (0, 1] that vanish precisely on A and B, respectively. Consider the 
function f — g.] 

(c) There is a familiar space that is completely normal but not perfectly normal. 
What is it? 


. Show that every locally compact Hausdorff space is completely regular. 


Let X be completely regular; let A and B be disjoint closed subsets of X. Show 
that if A is compact, there is a continuous function f : X — [0, 1] such that 


f(A) = {0} and f(B) = {1}. 


. Show that R” in the box topology is completely regular. [Hint: Show that it 


suffices to consider the case where the box neighborhood (—1, 1)/ is disjoint 
from A and the point is the origin. Then use the fact that a function continuous 
in the uniform topology is also continuous in the box topology.] 


Prove the following: 

Theorem. Every topological group is completely regular. 

Proof. Let Vo be a neighborhood of the identity element e, in the topological 
group G. In general, choose V, to be a neighborhood of e such that Vn - Va C 
V,-1. Consider the set of all dyadic rationals p, that is, all rational numbers of 
the form k/2", with k and n integers. For each dyadic rational p in (0, 1), define 
an open set U(p) inductively as follows: U(1) = Vo and UG) = Vı Givenn, 
if U(k/2") is defined for 0 < k/2" < 1, define 


U(/2"+)) = Vagis 
U((2k + 1)/2"*!) = Va41 < U(k/2") 
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for0 < k < 2". For p < 0, let U (p) = Ø, and for p > 1, let U(p) = G. Show 
that 


Va U(k/2") C U((k +:1)/2”) 


for all k and n. Proceed as in the Urysohn lemma. 

This exercise is adapted from [M-Z], to which the reader is referred for further 
results on topological groups. 
Define a set X as follows: For each even integer m, let Lm denote the line seg- 
ment m x [—1, 0] in the plane. For each odd integer n and each integer k > 2, 
let C,,% denote the union of the line segments (n + 1 — 1/k) x [—1, 0] and 
(n — 1 + 1/k) x (—1, 0] and the semicircle 


{x x y | (x —ny? + y? = (1 — 1/k)? and y > 0} 


in the plane. Let pa, denote the topmost point n x (1 — 1/k) of this semicircle. 

Let X be the union of all the sets Lm and C,.x, along with two extra points a 

and b. Topologize X by taking sets of the following four types as basis elements: 

(i) The intersection of X with a horizontal open line segment that contains 
none of the points pr.k- 
(ii) A set formed from one of the sets C, by deleting finitely many points 
(iii) For each even integer m, the union of {a} and the set of points x x y of 
X for which x < m. 
(iv) For each even integer m, the union of {b} and the set of points x x y of 
X for which x > m. 

(a) Sketch X; show that these sets form a basis for a topology on X. 

(b) Let f be a continuous real-valued function on X. Show that for any c, the 
set f—'(c) is a Gs set in X. (This is true for any space X.) Conclude that 
the set S, x consisting of those points p of C,,4 for which f(p) Æ f(Pn,k) 
is countable. Choose d € [—1, 0] so that the line y = d intersects none of 
the sets Sa.. Show that for n odd, 


f(a — 1) x d) = lim f (Pa) = f((n+ l) x d). 


Conclude that f(a) = f(b). 
(c) Show that X is regular but not completely regular. 
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Now we come to the major goal of this chapter, a theorem that gives us conditions 
under which a topological space is metrizable. The proof weaves together a number 
of strands from previous parts of the book; it uses results on metric topologies from 
Chapter 2 as well as facts concerning the countability and separation axioms proved in 
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the present chapter. The basic construction used in the proof is a simple one, but very 
useful. You will see it several times more in this book, in various guises. 

There are two versions of the proof, and since each has useful generalizations that 
will appear subsequently, we present both of them here. The first version generalizes 
to give an imbedding theorem for completely regular spaces. The second version will 
be generalized in Chapter 6 when we prove the Nagata-Smimov metnzation theorem. 


Theorem 34.1 (Urysohn metrization theorem). Every regular space X with a 
countable basis is metrizable. 


Proof. We shall prove that X is metrizable by imbedding X in a metrizable space Y, 
that is, by showing X homeomorphic with a subspace of Y. The two versions of 
the proof differ in the choice of the metrizable space Y. In the first version, Y is 
the space R” in the product topology, a space that we have previously proved to be 
metrizable (Theorem 20.5) In the second version, the space Y is also R®, but this 
time in the topology given by the uniform metric J (see §20). In each case, it tums out 
that our construction actually imbeds X in the subspace [0, 1)” of R” 

Step 1 We prove the following: There exists a countable collection of continuous 
functions fa : X — [0,1] having the property that given any point xo of X and 
any neighborhood U of xo, there exists an index n such that fa is positive at xọ and 
vanishes outside U. 

It is a consequence of the Urysohn lemma that, given xo and U, there exists such a 
function. However, if we choose one such function for each pair (xo, U), the resulting 
collection will not in general be countable. Our task is to cut the collection down to 
size. Here is one way to proceed: 

Let (B,} be a countable basis for X. For each pair n, m of indices for which 
B, C Bm, apply the Urysohn lemma to choose a continuous function gam. X > 
{0, 1] such that gnm (Bn) = {1} and gnm(X — Bm) = {0}. Then the collection (gn,m) 
satisfies our requirement: Given xo and given a neighborhood U of xo, one can choose 
a basis element B,, containing xo that is contained in U. Using regulanty, one can then 
choose B, so that x9 € B, and By C Bm. Then n,m isa pair of indices for which the 
function gpm is defined; and it is positive at xo and vanishes outside U. Because the 
collection {2n,m} is indexed with a subset of Z, x Z,, it is countable; therefore it can 
be reindexed with the positive integers, giving us the desired collection { fa}. 


Step 2 (First version of the proof) Given the functions fn of Step 1, take R” in the 
product topology and define a map F . X — R® by the rule 


F(x) = (fix), ha). -). 


We assert that F is an imbedding. 

First, F is continuous because R® has the product topology and each fy is contin- 
uous. Second, F is injective because given x Æ y, we know there is an index n such 
that f,(x) > O and f,(y) = 0; therefore, F(x) # F(y) 

Finally, we must prove that F is a homeomorphism of X onto its image, the sub- 
space Z = F(X) of R”. We know that F defines a continuous bijection of X with Z, 
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so we need only show that for each open set U in X, the set F(U) is open in Z. Let zo 
be a point of F(U). We shall find an open set W of Z such that 

zoe W C F(U). 


Let xo be the point of U such that F(xọ) = zo. Choose an index N for which 
f(xo) > O and fy(X — U) = {0}. Take the open ray (0, +00) in R, and let V be the 
open set 


V = np (0, +00)) 


of R”. Let W = V N Z; then W is open in Z, by definition of the subspace topology. 
See Figure 34.1. We assert that zo € W C F(U). First, zo € W because 


nn (zo) = ny (F (x0)) = fu (xo) > 0. 
Second, W C F(U). For if z € W, then z = F(x) for some x € X, and my(z) € 


(0, +00). Since xy (z) = ny (F(x)) = f(x), and fy vanishes outside U , the point x 
must be in U. Then z = F(x) is in F(U), as desired. 


2 


r 
Wy 


Figure 34.1 


Step 3 (Second version of the proof). In this version, we imbed X in the metnc 
space (R®, 0) Actually, we imbed X in the subspace [0, 1]®, on which 6 equals the 
metric 


p(x, y) = sup{lx; — yi}. 


§34 The Urysohn Metnzation Theorem 217 


We use the countable collection of functions fa : X —> (0, 1] constructed in Step 1. 
But now we impose the additional condition that fa(x) < l/n forall x (This condi- 
tion is easy to satisfy; we can just divide each function fa by n.) 

Define F : X — [0, 1)” by the equation 


F(x) = (fix), fae), --) 


as before. We assert that F is now an imbedding relative to the metric p on [0, 1)”. We 
know from Step 2 that F is injective. Furthermore, we know that if we use the product 
topology on (0, 1]”, the map F carries open sets of X onto open sets of the subspace 
Z = F(X). This statement remains true if one passes to the finer (larger) topology on 
(0, 1)” induced by the metric p 

The one thing left to do is to prove that F is continuous. This does not follow from 
the fact that each component function is continuous, for we are not using the product 
topology on R® now. Here is where the assumption f,(x) < 1/n comes in. 

Let x9 be a point of X, and let €e > 0. To prove continuity, we need to find a 
neighborhood U of xo such that 


x E€ U => p(F (x), F(x0)) < €. 


First choose N large enough that 1/N < €/2. Then for each n = 1,. ., N use the 
continuity of f, to choose a neighborhood U, of xo such that 


|fn(x) — fa(xo)l < €/2 
for x € Un. Let U = U;N- -N Uy; we show that U is the desired neighborhood 
of xo. Letx e U. Ifn < N, 

lfa) — falo) < €/2 
by choice of U. And if n > N, then 

|fn(x) — falxo)l < 1/N < €/2 
because fa maps X into [0, 1/n]. Therefore for all x € U, 
P(F (x), F(x0)) < €/2 < €, 


as desired. a 


In Step 2 of the preceding proof, we actually proved something stronger than the 
result stated there. For later use, we state it here. 


Theorem 34.2 (Imbedding theorem). Let X be a space in which one-point sets are 
closed. Suppose that { fa}aey is an indexed family of continuous functions fa ` X —> 
R satisfying the requirement that for each point xp of X and each neighborhood U 
of xo, there is an index a such that fq is positive af xo and vanishes outside U. Then 
the function F : X — R’ defined by 


F(x) = (fal) )aes 
is an imbedding of X in R? If fy maps X into (0, 1) for each a, then F imbeds X in 
fo, I}. 
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The proof is almost a copy of Step 2 of the preceding proof; one merely replaces n 
by a, and R” by R”, throughout. One needs one-point sets in X to be closed in order 
to be sure that, given x Æ y, there is an index @ such that fo (x) Æ fa()). 

A family of continuous functions that satisfies the hypotheses of this theorem is 
said to separate points from closed sets in X. The existence of such a famuly is readily 
seen to be equivalent, for a space X in which one-point sets are closed, to the re- 
quirement that X be completely regular. Therefore one has the following immediate 
corollary: 


Theorem 34.3. A space X is completely regular if and only if it is homeomorphic to 
a subspace of (0, 1] for some J. 


Exercises 


1. Give an example showing that a Hausdorff space with a countable basis need not 
be metrizable. 

2. Give an example showing that a space can be completely normal, and satisfy 
the first countability axiom, the Lindelof condition, and have a countable dense 
subset, and still not be metrizable. 


3. Let X be a compact Hausdorff space. Show that X is metrizable if and only if X 
has a countable basis. 


4. Let X be a locally compact Hausdorff space. Is it true that if X has a countable 
basis, then X is metrizable? Is it true that if X is metrizable, then X has a 
countable basis? 


5. Let X be a locally compact Hausdorff space. Let Y be the one-point compactifi- 
cation of X. Is it true that if X has a countable basis, then Y is metrizable? Is it 
true that if Y is metrizable, then X has a countable basis? 


6. Check the details of the proof of Theorem 34.2. 


7. A space X is locally metrizable if each point x of X has a neighborhood that is 
metrizable in the subspace topology. Show that a compact Hausdorff space X is 
metnzable if it is locally metnzable. (Hint. Show that X is a finite union of open 
subspaces, each of which has a countable basis.] 


8. Show that a regular Lindelöf space is metrizable if it is locally metrizable. (Hint: 
A closed subspace of a Lindelof space is Lindelof.] Regularity is essential; where 
do you use it in the proof? 

9. Let X be a compact Hausdorff space that is the union of the closed subspaces X 
and X2. If X, and X2 are metrizable, show that X is metrizable. [Hint- Construct 
a countable collection A of open sets of X whose intersections with X; forma 
basis for X;, fori = 1,2. Assume X; — X2 and X2 — X, belong to A. Let B 
consist of finite intersections of elements of A ] 


§35 The Tietze Extension Theorem 219 


*§35 The Tietze Extension Theorem? 


One immediate consequence of the Urysohn lemma is the useful theorem called the 
Tietze extension theorem. It deals with the problem of extending a continuous real- 
valued function that is defined on a subspace of a space X to a continuous function 
defined on all of X. This theorem is important in many of the applications of topology. 


Theorem 35.1 (Tietze extension theorem). Let X be a normal space; let A be a 
closed subspace of X. 

(a) Any continuous map of A into the closed interval [a, b] of R may be extended 
to a continuous map of all of X into [a, b]. 

(b) Any continuous map of A into R may be extended to a continuous map of all 
of X intoR. 


Proof. The idea of the proof is to construct a sequence of continuous functions Sp 
defined on the entire space X, such that the sequence s, converges uniformly, and such 
that the restnction of s, to A approximates f more and more closely as n becomes 
large. Then the limit function will be continuous, and its restriction to A will equal f. 


Step 1. The first step is to construct a particular function g defined on all of X such 
that g is not too large, and such that g approximates f on the set A to a fair degree of 
accuracy To be more precise, let us take the case f © A — {—,,r]. We assert that 
there exists a continuous function g : X —> R such that 

lg()[ <4r forallx € xX, 
lg(a) - f@l< 


The function g is constructed as follows: 
Divide the interval [—r, r] into three equal intervals of length ar. 


i= [-r, -4r] $ h= [-}. 5r] : h= [r r] . 


Let B and C be the subsets 


1 
3 
žr foralla € A. 


B=f'(U) an C=fbh) 


of A. Because f is continuous, B and C are closed disjoint subsets of A. Therefore, 
they are closed in X. By the Urysohn lemma, there exists a continuous function 


2: X — ex ir] 


having the property that g(x) = -4r for each x in B, and g(x) = ir for each x in C. 
Then |g(x)| < $r for all x. We assert that for each a in A, 


\g(a) — f(a)| < $r. 


tThis section will be assumed in §62. It 1s also used in a number of exercises 
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R 


Figure 35.1 


There are three cases. If a € B, then both f(a) and g(a) belong to J). Ifa € C, then 
f(a) and g(a) arein J3. Andifa ¢ BUC, then f(a) and g(a) are in J2. In each case, 
lg(a) — f(a)} < $r. See Figure 35.1. 


Step 2. We now prove part (a) of the Tietze theorem. Without loss of generality, 
we can replace the arbitrary closed interval [a, b] of R by the interval [—1, 1}. 


Let f © X — [-1, 1] be a continuous map. Then f satisfies the hypotheses 
of Step 1, with r = |. Therefore, there exists a continuous real-valued function g1, 
defined on all of X, such that 


lai@)is 1/3 forxe X, 
|f(a) —gi(a)| <2/3  forae A. 


Now consider the function f — gı. This function maps A into the interval [—2/3, 2/3}, 
so we can apply Step | again, letting r = 2/3. We obtain a real-valued function g2 
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defined on all of X such that 


1 /2 
lg2(x)| < 3 G) forxexX, 


2 2 
| f(a) — g1(a) — g2(a)| < (5) frae A 


Then we apply Step 1 to the function f — g; — 82. And so on. 


At the general step, we have real-valued functions 81, ..., g, defined on all of X 
such that 
2 n 
|f (a) — gı (a) — -++ — 8n (a)l < (5) 
fora € A. Applying Step 1 to the function f — 81 — --— 8n, with r = GP, we 


obtain a real-valued function 2,4; defined on all of X such that 


| wis ey. forx € X 
x pore je 
8n+l $343 ol ' 


n+l 
\f(a)— gı(a)—  - — gn (8)| < G) fora € A. 


By induction, the functions g, are defined for all n. 
We now define 


oo 
a(x) =) 8n(x) 
n=l 


for all x in X. Of course, we have to know that this infinite series converges. But that 
follows from the companson theorem of calculus, it converges by comparison with the 


geometric senes 
12/2 al 
2 
n=l 


To show that g is continuous, we must show that the sequence s, converges to g 
uniformly. This fact follows at once from the “Weierstrass M-test” of analysis. With- 
out assuming this result, one can simply note that if k > n, then 


k 
Dd si) 


tant! 


Isk (x) — Sa (x)| = 


222 Countability and Separation Axioms Ch. 4 


Holding n fixed and letting k —> œœ, we see that 


lg (x) — Sn(x)| < (5) 


for all x € X. Therefore, s, converges to g uniformly 

We show that g(a) = f(a) fora € A Let sp(x) = S77, gi(x), the nth partial 
sum of the series. Then g(x) is by definition the limit of the infinite sequence s,,(x) of 
partial sums. Since 


\f (a) — 28a = |f (a) — s„la)| < (5) 


for all a in A, it follows that s,(a@) —> f(a) for alla € A. Therefore, we have 
f(a) = g(a) fora E€ A. 

Finally, we show that g maps X into the interval [—1, 1]. This condition is in fact 
satisfied automatically, since the series (1/3) }-(2/3)" converges to 1. However, this 
is just a lucky accident rather than an essential part of the proof. If all we knew was 
that g mapped X into R, then the map r o g, where r : R — {—1, 1] is the map 


r(y) =y if |y| <1, 
r(y)=y/ly|  ifly| 21, 


would be an extension of f mapping X into [~—1, 1). 

Step 3 We now prove part (b) of the theorem, in which f maps A into R. We can 
replace R by the open interval (—1, 1), since this interval is homeomorphic to R. 

So let f be a continuous map from A into (—1, 1). The half of the Tietze theorem 
already proved shows that we can extend f to a continuous map g : X > [-1, 1] 
mapping X into the closed interval. How can we find a map A carrying X into the 
open interval? 

Given g, let us define a subset D of X by the equation 


D =g '({-1)ug (Ip. 


Since g is continuous, D is a closed subset of X. Because g(A) = f(A), which is 
contained in (—1, 1), the set A is disjoint from D. By the Urysohn lemma, there is a 
continuous function @ . X — (0, 1] such that (D) = (0} and (A) = (1). Define 


A(x) = o(x)g(x). 


Then A is continuous, being the product of two continuous functions. Also, h is an 
extension of f, since for a in A, 


h(a) = $(a)g(a) = 1- g(a) = f(a). 


Finally, h maps all of X into the open interval (~1, 1). For if x € D, then A(x) = 
0- g(x) =0. And if x ¢ D, then |g(x)| < 1; it follows that JA(x)| < 1-|g(x)| <1 m 
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Exercises 


1. Show that the Tietze extension theorem implies the Urysohn lemma. 


2. In the proof of the Tietze theorem, how essential was the clever decision in Step | 
to divide the interval [—r, r] into three equal pieces? Suppose instead that one 
divides this interval into the three intervals 


h =(-r,-ar)], h =[-ar,ar]}, h = [ar, r}, 


for some a with 0 < a < 1. For what values of a other than a = 1/3 (if any) 
does the proof go through? 

3. Let X be metnzable. Show that the following are equivalent: 

(i) X is bounded under every metric that gives the topology of X. 

(ii) Every continuous function ġ ` X — R is bounded. 
(iii) X is limit point compact. 
(Hint: If ġ : X — R is a continuous function, then F(x) = x x @(x) is an 
imbedding of X in X x R If A is an infinite subset of X having no limit point, 
let @ be a surjection of A onto Z, ] 

4. Let Z be a topological space. If Y is a subspace of Z, we say that Y is a retract 
of Z if there is a continuous map r : Z — Y such that r(y) = y foreach y € Y 
(a) Show that if Z is Hausdorff and Y is a retract of Z, then Y is closed in Z. 
(b) Let A be a two-point set in R?. Show that A is nota retract of R?. 

(c) Let S! be the unit circle in R?; show that S! is a retract of R? — {0}, where 0 
is the origin. Can you conjecture whether or not S! is a retract of IR?? 

5. A space Y is said to have the universal extension property if for each triple 
consisting of a normal space X, a closed subset A of X, and a continuous function 
f : A —> Y, there exists an extension of f to a continuous map of X into Y. 

(a) Show that R/ has the universal extension property. 
(b) Show that if Y is homeomorphic to a retract of R”, then Y has the universal 
extension property 

6. Let Y be a normal space Then Y is said to be an absolute retract if for every 
pair of spaces (Yo, Z) such that Z is normal and Yọ is a closed subspace of Z 
homeomorphic to Y, the space Yọ is a retract of Z. 

(a) Show that if Y has the universal extension property, then Y is an absolute 
retract. 
(b) Show that if Y ts an absolute retract and Y is compact, then Y has the univer- 
sal extension property. [Hint. Assume the Tychonoff theorem, so you know 
(0, 1]/ is normal. Imbed Y in (0, 1]/.} 
7. (a) Show the logarithmic spiral 


C = {0 x 0} U fe’ cost x e' sint |t € R} 


is a retract of R?. Can you define a specific retraction r ` R? > C? 
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Figure 35.2 
(b) Show that the “knotted x-axis” K of Figure 35.2 is a retract of R3. 
*8. Prove the following: 

Theorem. Let Y be a normal space. Then Y is an absolute retract if and only 
if Y has the universal extension property. 

[Hint: If X and Y are disjoint normal spaces, A is closed in X, and f : A > Y 
is a continuous map, define the adjunction space Zy to be the quotient space ob- 
tained from X U Y by identifying each point a of A with the point f (a) and with 
all the points of f~'({ f(a)}). Using the Tietze theorem, show that Z f is normal. 
If p` X UY — Z, is the quotient map, show that p|Y is a homeomorphism of 
Y with a closed subspace of Z ;.] 

9. Let X; C X2 C --- be a sequence of spaces, where X; is a closed subspace 


of X;4., foreach i. Let X be the union of the X;; let us topologize X by declaring 

a set U to be open in X if U N X; is open in X for each i. 

(a) Show that this is a topology on X and that each space X; is a subspace (in 
fact, a closed subspace) of X in this topology. This topology is called the 
topology coherent with the subspaces X;. 

(b) Show that f : X — Y is continuous if f|X; is continuous for each i. 

(c) Show that if each space X; is normal, then X is normal. [Hint: Given disjoint 
closed sets A and B in X, set f equal to 0 on A and | on B, and extend f 
successively to AU BU X; fori = 1,2,... J 


*§36 Imbeddings of Manifolds’ 


We have shown that every regular space with a countable basis can be imbedded in the 
“infinite-dimensional” euclidean space R® It is natural to ask under what conditions a 
space X can be imbedded in some finite-dimensional euclidean space R”. One answer 
to this question is given in this section. A more general answer will be obtained in 
Chapter 8, when we study dimension theory. 


*This section will be assumed when we study paracompactness in §41 and when we study dimen- 
sion theory in §50 
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Definition. An m-manifold is a Hausdorff space X with a countable basis such that 
each point x of X has a neighborhood that is homeomorphic with an open subset 
of R”. 

A |-manifold is often called a curve, and a 2-manifold is called a surface. Man- 
ifolds form a very important class of spaces; they are much studied in differential 
geometry and algebraic topology. 

We shall prove that if X is a compact manifold, then X can be imbedded in a finite- 
dimensional euclidean space. The theorem holds without the assumption of compact- 
ness, but the proof is a good deal harder. 

First, we need some terminology. 

If @¢ : X — R, then the support of @ is defined to be the closure of the set 
¢@~'(IR — {0}). Thus if x lies outside the support of ¢, there is some neighborhood of x 
on which @ vanishes 


Definition. Let (U,,..., Un} be a finite indexed open covering of the space X. An 
indexed family of continuous functions 


ġi: X — [0,1] fori=1,..., n, 
is said to be a partition of unity dominated by {U,} if- 
(1) (support ¢;) C U; for each i. 
(2) I; Gi (x) = | for each x. 


Theorem 36.1 (Existence of finite partitions of unity). Ler{U,,..., Un} be a finite 
open covering of the normal space X. Then there exists a partition of unity dominated 
by {Uj} 
Proof. Step 1. First, we prove that one can “shrink” the covering {U;} to an open 
covering {Vi,.. , Vn} of X such that V; C U; for each i. 

We proceed by induction. First, note that the set 


A=X-(U2U UU) 


is a closed subset of X. Because {U}, .. ., Un} covers X, the set A is contained in the 
open set U;. Using normality, choose an open set V; containing A such that V; C Uj. 
Then the collection {V,, U2,..., Un} covers X 

In general, given open sets V;,..., Vi; such that the collection 


(Vi. <., Vk-1, Uk, Una... » Un) 
covers X, let 
A= X = (Vi U- U VWk-1)— (Ugg U- U Un). 


Then A is a closed subset of X which is contained in the open set Ug. Choose V; to be 
an open set containing A such that V; C Uy. Then {V1,.. , Vk-1, Vk, Uk+1,.--, Un} 
covers X. At the nth step of the induction, our result is proved. 
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Step 2 Now we prove the theorem. Given the open covenng {U;,..., Un} of X, 
choose an open covenng {V;,..., Va} of X such that V; C Ui; for each i. Then choose 
an open covering {W], ..., Wa} of X such that W; C V; foreachi Using the Urysohn 
lemma, choose for each i a continuous function 


ý; : X — [0, 1} 


such that yi(Wi) = {1} and w;(X — V;) = {0}. Since Y(R- {0} is contained in V,, 
we have 


(support yj) C V; C Ui. 


Because the collection {W;} covers X, the sum V(x) = Viet y(x) is positive for 
each x. Therefore, we may define, for each j, 


wy) 
$; (x)= Va) : 
It is easy to check that the functions ¢), ..., ¢n form the desired partition of unity. W 


There is a comparable notion of partition of unity when the open covering and the 
collection of functions are not finite, nor even countable. We shal! consider this matter 
in Chapter 6, when we study paracompactness. 


Theorem 36.2. If X is a compact m-manifold, then X can be imbedded in R” for 
some positive integer N. 


Proof. Cover X by finitely many open sets {U;,.... Un}, each of which may be 
imbedded in R”. Choose imbeddings g; : U; — R” for each i. Being compact and 
Hausdorff, X is normal. Let ¢), ..., ¢, be a partition of unity dominated by {U;}; let 
A; = Support ¢;. For each i = 1, ..., n, define a function A; : X — R” by the rule 


Gi(x) gi(x)  forx € Uj, 
h, (x) = 
0=(0,...,0) forx € X — Aj. 
(Here ¢; (x) is areal number c and g;(x) is a pointy = (y1, ..., Ym) of R"; the product 
c y denotes of course the point (cy|,.. , CYm) of R”.] The function h; is well defined 
because the two definitions of h; agree on the intersection of their domains, and h; is 
continuous because its restrictions to the open sets U; and X — A, are continuous. 


Now define 
F:X —+(Rx---x Rx R” x- x R”) 
rm re” 
n times n times 
by the rule 


F(x) = (91 (4), <- a(x), hi), -~ , An (x) 
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Clearly, F is continuous. To prove that F is an imbedding we need only to show that 
F is injective (because X is compact). Suppose that F(x) = F(y). Then ¢i(x) = 
gi(y) and hj(x) = h,(y) for all i. Now ¢;(x) > 0 for some i [since $ ¢; (x) = 1} 
Therefore, ¢;(y) > 0 also, so that x, y € U;. Then 


Gi (x) + B(x) = hi (x) = hi(y) = i (y) - gi (y) 


Because ¢,(x) = $;(y) > 0, we conclude that g;(x) = g;(y). But g; : U; — R” is 
injective, so that x = y, as desired. a 


In many applications of partitions of unity, such as the one just given, all one needs 
to know is that the sum $` ¢;(x) is positive for each x. In others, however, one needs 
the stronger condition that that $` ġ; (x) = 1. See §50. 


Exercises 


1. Prove that every manifold ts regular and hence metrizable. Where do you use the 
Hausdorff condition? 


2. Let X be a compact Hausdorff space. Suppose that for each x € X, there is a 
neighborhood U of x and a positive integer k such that U can be imbedded in R‘. 
Show that X can be imbedded in R” for some positive integer N. 


3. Let X be a Hausdorff space such that each point of X has a neighborhood that is 
homeomorphic with an open subset of R”. Show that if X is compact, then X is 
an m-manifold. 


4. An indexed family {Aq} of subsets of X is said to be a point-finite indexed family 
if each x € X belongs to Ag for only finitely many values of æ. 
Lemma (The shrinking lemma). Let X be a normal space; let {U}, U2,.. } be 
a point-finite indexed open covering of X. Then there exists an indexed open 
covering {V,, V2,...] of X such that Va C Un for each n. 


5. The Hausdorff condition is an essential part of the definition of a manifold; it is 
not implied by the other parts of the definition. Consider the following space: 
Let X be the union of the set R — {0} and the two-point set {p, q}. Topologize X 
by taking as basis the collection of all open intervals in R that do not contain 0, 
along with all sets of the form (—a, 0) U {p} U (0, a) and all sets of the form 
(—a,0) U {q} U (0,a), fora > 0. The space X is called the line with two 
origins 
(a) Check that this is a basis for a topology 
(b) Show that each of the spaces X — {p} and X — {q} is homeomorphic to R. 
(c) Show that X satisfies the 7, axiom, but is not Hausdorff. 

(d) Show that X satisfies all the conditions for a 1-manifold except for the Haus- 
dorff condition. 
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*Supplementary Exercises: Review of the Basics 


Consider the following properties a space may satisfy: 
(1) connected 
(2) path connected 
(3) locally connected 
(4) locally path connected 
(5) compact 
(6) limut point compact 
(7) locally compact Hausdorff 
(8) Hausdorff 
(9) regular 
(10) completely regular 
(11) normal 
(12) first-countable 
(13) second-countable 
(14) Lindelöf 
(15) has a countable dense subset 
(16) locally metnzable 
(17) metrizable 
1. For each of the following spaces, determine (if you can) which of these properties 
it satisfies. (Assume the Tychonoff theorem if you need it.) 
(a) So 
(b) S2 
(c) So x SQ 
(d) The ordered square 
(e) Re 
(f) R? 
(g) R” in the product topology 
(h) R” in the uniform topology 
(i) R” in the box topology 
(j) R’ in the product topology, where J = (0, 1] 
(k) Rx 
2. Which of these properties does a metnc space necessarily have? 
3. Which of these properties does a compact Hausdorff space have? 
4. Which of these properties are preserved when one passes to a subspace? To a 
closed subspace? To an open subspace? 


5. Which of these properties are preserved under finite products? Countable prod- 
ucts? Arbitrary products? 
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6. Which of these properties are preserved by continuous maps? 
7. After studying Chapters 6 and 7, repeat Exercises 1—6 for the following proper- 
ties: 
(18) paracompact 
(19) topologically complete 
You should be able to answer all but one of the 340 questions involved in Exer- 
cises 1-6, and all but one of the 40 questions involved in Exercise 7. These two are 
unsolved; see the remark in Exercise 5 of §32. 


Chapter 5 


The Tychonoff Theorem 


We now return to a problem we left unresolved in Chapter 3. We shall prove the 
Tychonoff theorem, to the effect that arbitrary products of compact spaces are compact. 
The proof makes use of Zorn’s Lemma (see §11). An alternate proof, which relies 
instead on the well-ordenng theorem, is outlined in the exercises. 

The Tychonoff theorem is of great usefulness to analysts (less so to geometers). 
We apply it in §38 to construct the Stone-Cech compactification of a completely regu- 
lar space, and tn §47 in proving the general version of Ascoli’s theorem. 


§37 The Tychonoff Theorem 


Like the Urysohn lemma, the Tychonoff theorem is what we call a “deep” theorem. Its 
proof involves not one but several original ideas; it is anything but straightforward. We 
shall discuss the crucial ideas of the proof in some detail before turning to the proof 
itself. 

In Chapter 3, we proved the product X x Y of two compact spaces to be compact. 
For that proof the open covering formulation of compactness was quite satisfactory. 
Given an open covering of X x Y by basis elements, we covered each slice x x Y by 
finitely many of them, and proceeded from that to construct a finite covering of X x Y. 

It is quite tricky to make this approach work for an arbitrary product of com- 
pact spaces; one must well-order the index set and use transfinite induction. (See 
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Exercise 5.) An alternate approach is to abandon open coverings and to approach the 
problem by applying the closed set formulation of compactness, using Zorn’s lemma. 

To see how this idea might work, let us consider first the simplest possible case: 
the product of two compact spaces X; x X2. Suppose that Æ is a collection of closed 
subsets of X; x X2 that has the finite intersection property. Consider the projection 
map 7, : Xı x X2 > X,. The collection 


{7(A) | A € A} 


of subsets of X; also has the finite intersection property, and so does the collection of 
their closures xt; (A). Compactness of X; guarantees that the intersection of all the sets 
zı (A) is nonempty. Let us choose a point x; belonging to this intersection. Similarly, 
let us choose a point x2 belonging to all the sets 72(A). The obvious conclusion we 
would like to draw is that the point x; x x2 lies in Maca A, for then our theorem would 
be proved. 

But that is unfortunately not true. Consider the following example, in which X; = 
X2 = (0, 1] and the collection A consists of all closed elliptical regions bounded by 
ellipses that have the points p = d, i) and q = G, 3) as their foci. See Figure 37.1. 
Certainly A has the finite intersection property. Now let us pick a point xı in the 
intersection of the sets {77(A) | A € A} Any point of the interval [}, 4] will do; 
suppose we choose x) = 4. Similarly, choose a point x2 in the intersection of the sets 
{m2(A) | A € A}. Any point of the interval G, 3] will do; suppose we pick x2 = 4 
This proves to be an unfortunate choice, for the point 


xpxxw=4xF 


does not lie in the intersection of the sets A. 


Figure 37.1 


“Aha!” you say, “you made a bad choice. If after choosing x; = 5 you had chosen 
n= 3, then you would have found a point in (laea 4.” The difficulty with our 
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tentative proof is that it gave us too much freedom in picking x; and x2; it allowed us 
to make a “bad” choice instead of a “good” choice. 

How can we alter the proof so as to avoid this difficulty? 

This question leads to the second idea of the proof: Perhaps if we expand the 
collection A (retaining the finite intersection property, of course), that expansion will 
restrict the choices of x, and x2 sufficiently that we will be forced to make the “nght” 
choice. To illustrate, suppose that in the previous example we expand the collection A 
to the collection D consisting of all closed elliptical regions bounded by ellipses that 
have p = G, 3) as one focus and any point of the line segment pq as the other focus. 
This collection is illustrated in Figure 37.2. The new collection D still has the finite 
intersection property. But if you try to choose a point x; in 


Q m\(D), 
DeD 


the only possible choice for xı is l Similarly, the only possible choice for x2 is i. 


And i x 4 does belong to every set D, and hence to every set A. In other words, 
expanding the collection A to the collection D forces the proper choice on us. 


a! 1 
3 2 


Figure 37.2 


Now of course in this example we chose D carefully so that the proof would work. 
What hope can we have for choosing D correctly in general? Here is the third idea of 
the proof: Why not simply choose D to be a collection that is “as large as possible” — 
so that no larger collection has the finite intersection property—and see whether such 
a D will work? It is not at all obvious that such a collection D exists; to prove it, we 
must appeal to Zorn’s lemma. But after we prove that D exists, we shall in fact be 
able to show that D is large enough to force the proper choices on us. 

A final remark. The assumption that the elements of the collection A were closed 
sets was irrelevant in this discussion. For even if the set A € A is closed, the set zı (A) 
need not be closed, so we had to take its closure in order to apply the closed set formu- 


§37 The Tychonoff Theorem 233 


lation of compactness. Therefore, we may as well begin with an arbitrary collection 
of subsets of X having the finite intersection property, and prove that the intersection 
of their closures is nonempty. This approach actually proves to be more convenient. 


Lemma 37.1. Let X be a set; let A be a collection of subsets of X having the 
finite intersection property. Then there is a collection D of subsets of X such that D 
contains A, and D has the finite intersection property, and no collection of subsets 
of X that properly contains D has this property. 


We often say that a collection D satisfying the conclusion of this theorem is max- 
imal with respect to the finite intersection property 
Proof. As you might expect, we construct D by using Zorn’s lemma. It states that, 
given a set A that is stnctly partially ordered, in which every simply ordered subset 
has an upper bound, A itself has a maximal element. 

The set A to which we shall apply Zorn’s lemma is not a subset of X, mor even a 
collection of subsets of X, but a set whose elements are collections of subsets of X. 
For purposes of this proof, we shall call a set whose elements are collections of subsets 
of X a “superset” and shall denote it by an outline letter. To summanze the notation: 

c is an element of X. 

C isa subset of X 

C is a collection of subsets of X 

C is a superset whose elements are collections of subsets of X. 

Now by hypothesis, we have a collection A of subsets of X that has the finite 
intersection property. Let A denote the superset consisting of all collections B of 
subsets of X such that B D A and B has the finite intersection property. We use 
proper inclusion Ç as our strict partial order on A. To prove our lemma, we need to 
show that A has a maximal element D. 

In order to apply Zorn’s lemma, we must show that if B is a “subsuperset” of A 
that is simply ordered by proper inclusion, then B has an upper bound in A. We shall 
show in fact that the collection 

C=|]}a, 


BeB 


which is the union of the collections belonging to B, is an element of A; then it is the 
required upper bound on B. 

To show that C is an element of A, we must show that C D A and that C has 
the finite intersection property. Certainly C contains A, since each element of B con- 
tains Æ. To show that C has the finite intersection property, let C;,..., Cn be elements, 
of C. Because C is the union of the elements of B, there is, for each i, an element B; 
of B such that C; € 8;. The superset {8),..., Ba} is contained in B, so it is simply 
ordered by the relation of proper inclusion. Being finite, it has a largest element; that 
is, there is an index k such that 8; C Bx fori = 1,...,n. Then all the sets C1, ..., Cn 
are elements of By. Since B, has the finite intersection property, the intersection of 
the sets C),..., Cn is nonempty, as desired. a 
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Lemma 37.2. Let X be a set; let D be a collection of subsets of X that is maximal 
with respect to the finite intersection property. Then: 
(a) Any finite intersection of elements of D is an element of D. 
(b) If A is a subset of X that intersects every element of D, then A is an element 
of D. 


Proof. (a) Let B equal the intersection of finitely many elements of D. Define a 
collection & by adjoining B to D, so that € = DU{B}. We show that & has the finite 
intersection property; then maximality of D implies that € = D, so that B € Das 
desired 

Take finitely many elements of &. If none of them is the set B, then their intersec- 
tion is nonempty because D has the finite intersection property. If one of them is the 
set B, then their intersection is of the form 


DiN- -AD”ANB 


Since B equals a finite intersection of elements of D, this set is nonempty. 

(b) Given A, define € = DU {A} We show that € has the finite intersection prop- 
erty, from which we conclude that A belongs to D. Take finitely many elements of &. 
If none of them is the set A, their intersection is automatically nonempty. Otherwise, 
it is of the form 


DiN---N DANA. 


Now Dı N --- N Dr belongs to D, by (a); therefore, this intersection is nonempty, by 
hypothesis. a 


Theorem 37.3 (Tychonoff theorem). An arbitrary product of compact spaces is 
compact in the product topology 


Proof. Let 


X=] xa 
aes 
where each space Xq is compact. Let A be a collection of subsets of X having the 
finite intersection property. We prove that the intersection 
Nå 
ACA 
is nonempty. Compactness of X follows. 

Applying Lemma 37.1, choose a collection D of subsets of X such that D D A 
and D is maximal with respect to the finite intersection property It will suffice to 
show that the intersection (| pep D is nonempty. 

Given a € J, let ty : X —> Xa be the projection map, as usual Consider the 
collection 


{ta(D) | D € D} 
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of subsets of X«. This collection has the finite intersection property because D does. 
By compactness of Xa, we can for each a choose a point xa of Xa such that 


Xa € (| ma(D). 
DED 


Let x be the point (xXæ)aes of X. We shall show that x € D for every D € D; then our 
proof will be finished. 

First we show that if mg 1(Up) is any subbasis element (for the product topology 
on X) containing x, then zg (Up) intersects every element of D. The set Ug ìs a 
neighborhood of xg in Xg. Since xg € mg(D) by definition, Ug intersects mg(D) in 
some point 1g(y), where y € D Then it follows that y € x3 '(Ug) N D. 

It follows from (b) of Lemma 37.2 that every subbasis element containing x be- 
longs to D. And then it follows from (a) of the same lemma that every basis element 
containing x belongs to D. Since D has the finite intersection property, this means 
that every basis element containing x intersects every element of D; hence x € D for 
every D € D as desired. a 


Exercises 


1. Let X be a space. Let D be a collection of subsets of X that is maximal with 
respect to the finite intersection property 
(a) Show that x € D for every D € D if and only if every neighborhood of x 
belongs to D. Which implication uses maximality of D? 
(b) Let D € D. Show that if A D D, then A € D. 
(c) Show that if X satisfies the T, axiom, there is at most one point belonging 
to pen P- 


2. A collection A of subsets of X has the countable intersection property if every 
countable intersection of elements of A is nonempty. Show that X is a Lindelof 
space if and only if for every collection A of subsets of X having the countable 
intersection property, 


Qå 


ACA 


is nonempty. 


3. Consider the three statements: 
(i) If X is a set and A is a collection of subsets of X having the count- 
able intersection property, then there is a collection D of subsets of X 
such that D > A and D is maximal with respect to the countable 

intersection property 
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(ii) Suppose D is maximal with respect to the countable intersection prop- 
erty Then countable intersections of elements of D are in D. Further- 
more, if A is a subset of X that intersects every element of D, then A 
is an element of D. 
(iii) Products of Lindelöf spaces are Lindelof. 
(a) Show that (i) and (ii) together imply (tii). 
(b) Show that (ii) holds. 
(c) Products of Lindelöf spaces need not be Lindelöf (see §30). Therefore (i) 
does not hold. If one attempts to generalize the proof of Lemma 37.1 to the 
countable intersection property, at what point does the proof break down? 


. Here is another theorem whose proof uses Zorn’s lemma. Recall that if A is a 


space and if x, y € A, we say that x and y belong to the same quasicomponent 

of A if there is no separation A = C U D of A into two disjoint sets open in A 

such that x € C and y € D. 

Theorem. Let X be a compact Hausdorff space. Then x and y belong to the 

same quasicomponent of X if and only if they belong to the same component 

of X. 

(a) Let A be the collection of all closed subspaces A of X such that x and y lie in 
the same quasicomponent of A. Let B be a subcollection of A that is simply 
ordered by proper inclusion. Show that the intersection of the elements of B 
belongs to A. [Hint: Compare Exercise 1! of §26.] 

(b) Show A has a minimal element D. 

(c) Show D is connected. 

Here is a proof of the Tychonoff theorem that relies on the well-ordenng theo- 

tem rather than on Zorn’s lemma. First, prove the following version of the tube 

lemma; then prove the theorem. 

Lemma. Let A be a collection of basis elements for the topology of the product 

space X x Y, such that no finite subcollection of A covers X x Y. If X is 

compact, there is a point x € X such that no finite subcollection of A covers the 

slice {x} x Y. 

Theorem. An arbitrary product of compact spaces is compact in the product 

topology. 

Proof. Let {Xa}wes be an indexed family of compact spaces, let 


X= [] Xe. 
aes 


Let tq : X — Xa be the projection map. Well-order J, once and for all, in such 

a way that J has a largest element. 

(a) Let B € J Suppose points p; € X; are given, for alli < $. For anya < $, 
let Ya denote the subspace of X defined by the equation 


Ya = {x | m, (x) = pj fori < a}. 


Note that if œ < a’, then Ya > Yw. Show that if A is a finite collection of 
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basis elements for X that covers the space 


Zp = ( ) Ya = {x1 m(x) = pi fori < B), 


a<B 


then A actually covers Ya for some a < $. (Hint. If B has an immediate 
predecessor in J, let œ be that immediate predecessor Otherwise, for each 
A € A, let Ja denote the set of those indices i < 8 for which 7,(A) Æ Xj; 
the union of the sets J4, for A € A, is finite; let a be the largest element of 
this union.] 

Assume A is a collection of basis elements for X such that no finite subcol- 
lection of A covers X. Show that one can choose points p, € X; for all i, 
such that for each æ, the space Y defined in (a) cannot be finitely covered 
by A. When a is the largest element of J, one has a contradiction. [Hint: If 
a is the smallest element of J, use the preceding lemma to choose pg. If p; 
is defined for all i < £, note that (a) implies that the space Zg cannot be 
finitely covered by A and use the lemma to find pg ] 


(b 


<~ 


§38 The Stone-Cech Compactification 


We have already studied one way of compactifying a topological space X, the one- 
point compactification (§29); it is in some sense the minimal compactification of X. 
The Stone-Cech compactification of X, which we study now, is in some sense the 
maximal compactification of X. It was constructed by M. Stone and E. Cech, inde- 
pendently, in 1937. It has a number of applications in modern analysis, but these lie 
outside the scope of this book 

We recall the following definition: 


Definition. A compactification of a space X is a compact Hausdorff space Y con- 
taining X as a subspace such that X = Y. Two compactifications Y, and Y2 of X are 
said to be equivalent if there is a homeomorphism h : Y} —> Yz such that A(x) = x 
for every x € X. 


If X has a compactification Y, then X must be completely regular, being a sub- 
space of the completely regular space Y. Conversely, if X is completely regular, then 
X has a compactification. For X can be imbedded in the compact Hausdorff space 
(0, 1]/ for some J, and any such imbedding gives rise to a compactification of X, as 
the following lemma shows: 


Lemma 38.1. Let X be a space; suppose that h . X — Z is an imbedding of X in 
the compact Hausdorff space Z. Then there exists a corresponding compactification Y 
of X; it has the property that there is an imbedding H : Y — Z that equals h on X. 
The compactification Y is uniquely determined up to equivalence. 
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We call Y the compactification induced by the imbedding h. 


Proof. Given h, let Xo denote the subspace A(X) of Z, and let Yo denote its clo- 
sure in Z. Then Yo is a compact Hausdorff space and Xo = Yo; therefore, Yo is a 
compactification of Xo. 

We now construct a space Y containing X such that the pair (X, Y) is homeomor- 
phic to the pair (Xo, Yo). Let us choose a set A disjoint from X that is in bijective 
correspondence with the set Yo — Xo under some map k : A — Yo — Xo. Define 
Y = X UA, and define a bijective correspondence H : Y — Yo by the rule 


H(x)=h(x) forxe X, 
H(a)=k(a) foraeA. 


Then topologize Y by declaring U to be open in Y it and only if H(U) is open in Yo. 
The map H is automatically a homeomorphism; and the space X is a subspace of Y 
because H equals the homeomorphism A when restricted to the subspace X of Y. By 
expanding the range of H, we obtain the required imbedding of Y into Z. 

Now suppose Y; is a compactification of X and that Hj : Y; —> Z is an imbedding 
that is an extension of h, fori = 1,2. Now H; maps X onto A(X) = Xo Because 
H; is continuous, it must map Y, into Xo; because H;(Y;) contains Xo and is closed 
(being compact), it contains Xo. Hence H;(Y,) = Xo. and H;* o H; defines a home- 
omorphism of Yı with Yz that equals the identity on X. a 


In general, there are many different ways of compactifying a given space X. Con- 
sider for instance the following compactifications of the open interval X = (0, 1): 
EXAMPLE | Teke the unit circle S! in R? and let h . (0, 1) > S! be the map 
h(t) = (cos 2zt) x (sin2xt). 
The compactification induced by the imbedding A is equivalent to the one-point compacti- 


fication of X 


EXAMPLE 2 Let Y be the space [0, |] Then Y is a compactification of X, it is obtained 
by “adding one point at each end of (0, 1)” 


EXAMPLE 3. Consider the square (—1, 1}? in R? and leth (0, 1) > (~1, 1]? be the 
map 
h(x) = x x sin(1/x). 


The space Yọ = h(X) is the topologist’s sine curve (see Example 7 of §24). The imbed- 
ding h gives rise to a compactification of (0, 1) quite different from the other two. It is 
obtained by adding one point at the right-hand end of (0, 1), and an entire line segment of 
points at the left-hand end! 


A basic problem that occurs in studying compactifications is the following: 


Uf Y is a compactification of X, under what conditions can a continuous 
real-valued function f defined on X be extended continuously to Y ? 
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The function f will have to be bounded if it is to be extendable, since its extension 
will carry the compact space Y into R and will thus be bounded. But boundedness is 
not enough, in general. Consider the following example: 
EXAMPLE 4 Let X = (0,t) Consider the one-point compactification of X given 
in Example | A bounded continuous function f : (0,1) —> R is extendable to this 
compactification if and only if the limits 


lim f(x) and lim f(x) 
x04 x=>l- 


exist and are equal. 

For the “the two-point compactification” of X considered in Example 2, the function f 
is extendable if and only if both these limits simply exist 

For the compactification of Example 3, extensions exist for a still broader class of 
functions It is easy to see that f is extendable if both the above limits exist But the func- 
tion f(x) = sin(1/x) is also extendable to this compactification’ Let H be the imbedding 
of Y in R? that equals h on the subspace X Then the composite map 


y—-RxR—+R 


is the desired extension of f. For if x € X, then H(x) = h(x) = x x sin(l/ x), so that 

m2(H(x)) = sin(1/x), as desired 

There is something especially interesting about this last compactification. We con- 
structed it by choosing an imbedding 


h . (0,1) — R? 


whose component functions were the functions x and sin(1/x) Then we found that 
both the functions x and sin(1/x) could be extended to the compactification This 
suggests that if we have a whole collection of bounded continuous functions defined 
on (0, 1), we might use them as component functions of an imbedding of (0, 1) into R/ 
for some J, and thereby obtain a compactification for which every function in the 
collection is extendable. 

This idea is the basic idea behind the Stone-Cech compactification. It is defined as 
follows: 


Theorem 38.2. Let X be a completely regular space. There exists a compactifica- 
tion Y of X having the property that every bounded continuous map f : X > R 
extends uniquely to a continuous map of Y into R. 


Proof. Let { fajaes be the collection of all bounded continuous real-valued functions 
on X, indexed by some index set J For each a € J, choose a closed interval Jy in R 
containing f,(X). To be definite, choose 


la = [inf fo(X), sup fa(X)J. 
Then define h . X > []y¢, la by the rule 
A(x) = (falx))aes- 
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By the Tychonoff theorem, [] /a is compact Because X is completely regular, the 
collection { fa} separates points from closed sets in X. Therefore, by Theorem 34.2, 
the map h is an imbedding. 

Let Y be the compactification of X induced by the imbedding h. Then there is 
an imbedding H : Y — [] J, that equals h when restncted to the subspace X of Y. 
Given a bounded continuous real-valued function f on X, we show it extends to Y. 
The function f belongs to the collection {fa}lees, So it equals fg for some index £. 
Let zg . [| Ia — Ig be the projection mapping Then the continuous map mg o H : 
Y — Ig is the desired extension of f. For if x € X, we have 


mp(H(x)) = mp(A(x)) = ng (( fale )aes) = fp). 


Uniqueness of the extension is a consequence of the following lemma. u 


Lemma 38.3. Let A C X; let f : A —> Z be a continuous map of A into the 
Hausdorff space Z. There is at most one extension of f to a continuous function 
g:4 >Z. 


Proof. This lemma was given as an exercise in §18; we give a proof here. Suppose 
that g, g’ * A > X are two different extensions of f , choose x so that g(x) # g'(x). 
Let U and U’ be disjoint neighborhoods of g(x) and g'(x), respectively. Choose a 
neighborhood V of x so that g(V) C U and g’(V) C U’ Now V intersects A in some 
point y; then g(y) € U and g’(y) € U’. But since y € A, we have g(y) = f(y) and 
2'(y) = f(y). This contradicts the fact that U and U’ are disjoint. | 


Theorem 38.4. Let X be a completely regular space; let Y be a compactification 
of X satisfying the extension property of Theorem 38.2 Given any continuous map 
f - X — C of X into a compact Hausdorff space C, the map f extends uniquely to a 
continuous map g ` Y > C. 


Proof. Note that C is completely regular, so that it can be imbedded in (0, 1]? for 
some J So we may as well assume that C  [0, 1]’. Then each component function 
fa of the map f is a bounded continuous real-valued function on X , by hypothesis, fy 
can be extended to a continuous map Za of Y into R. Define g : Y — R” by setting 
8(y) = (ga(y))aes; then g is continuous because R” has the product topology. Now 
in fact g maps Y into the subspace C of R/. For continuity of g implies that 


g(¥) = 8(X) C g(X) = F(X) CC HC. 
Thus g is the desired extension of f a 
Theorem 38.5. Let X be a completely regular space. If Yı and Yz are two compact- 


ifications of X satisfying the extension property of Theorem 38.2, then Yı and Yz are 
equivalent. 
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Proof. Consider the inclusion mapping j2 : X — Y2. It is a continuous map of X 
into the compact Hausdorff space Y2. Because Y; has the extension property, we may, 
by the preceding theorem, extend jz to a continuous map fz : Yı —> Yz. Similarly, 
we may extend the inclusion map jı : X — Yı to a continuous map fi : Yo > Yı 
(because Yz has the extension property and Y, is compact Hausdorff). 


X cy X ch 
al K A 
Y Yı 


The composite fı o f2 : Yı — Yı has the property that for every x € X, one has 
filf2(x)) = x Therefore, fı o f2 is a continuous extension of the identity map 
ix : X — X. But the identity map of Yı is also a continuous extension of ix. By 
uniqueness of extensions (Lemma 38.3), fı o f2 must equal the identity map of Y. 
Similarly, f2 o fı must equal the identity map of Y) Thus fı and fz are homeomor- 
phisms. | 


Definition. For each completely regular space X, let us choose, once and for all, 
a compactification of X satisfying the extension condition of Theorem 38.2. We will 
denote this compactification of X by £ (X) and call it the Stone-Cech compactification 
of X. It is characterized by the fact that any continuous map f . X —> C of X intoa 
compact Hausdorff space C extends uniquely to a continuous map g £(X) —> C. 


Exercises 


1. Verify the statements made in Example 4. 


2. Show that the bounded continuous function g (0,1) —> R defined by g(x) = 
cos(1/x) cannot be extended to the compactification of Example 3. Define an 
imbedding h : (0, 1) — (0, 1p such that the functions x, sin(1/x), and cos(1/x) 
are all extendable to the compactification induced by A. 

3. Under what conditions does a metrizable space have a metrizable compactifica- 
tion? 

4. Let Y be an arbitrary compactification of X; let 8(X) be the Stone-Cech com- 
pactification. Show there is a continuous surjective closed map g : B(X) > Y 
that equals the identity on X 

[This exercise makes precise what we mean by saying that (X) is the “maxi- 
mal” compactification of X. It shows that every compactification of X is equiv- 
alent to a quotient space of 8(X).J 

5. (a) Show that every continuous real-valued function defined on Sq is “‘eventu- 

ally constant.” [Hint: First prove that for each e, there is an element a of Sc 
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such that | f(8) — f(a@)| < € forall 8 > a Then lete = 1/n for n € Z4} 
and consider the corresponding points a,.] 

(b) Show that the one-point compactification of Sq and the Stone-Cech com- 
pactification are equivalent. 

(c) Conclude that every compactification of Sg is equivalent to the one-point 
compactification. 


. Let X be completely regular. Show that X is connected if and only if B(X) is 


connected. (Hint: If X = AU B isa separation of X, let f(x) = Oforx € A 
and f(x) = 1 forx € B.] 


. Let X be a discrete space; consider the space B(X). 


(a) Show that if A C X, then A and X — A are disjoint, where the closures are 
taken in B(X). 

(b) Show that if U is open in 8(X), then U is open in B(X). 

(c) Show that 8(X) is totally disconnected. 


. Show that 6(Z) has cardinality at least as great as I! , where J = (0, 1). [Hint: 


The space /! has a countable dense subset.] 


. (a) If X is normal and y is a point of B(X) — X, show that y is not the limit of 


a sequence of points of X. 
(b) Show that if X is completely regular and noncompact, then (X) is not 
metnizable. 


We have constructed a correspondence X —> £(X) that assigns, to each com- 
pletely regular space, its Stone-Cech compactification. Now let us assign, to each 
continuous map f : X — Y of completely regular spaces, the unique continuous 
map B(f): B(X) —> B(Y) that extends the map i o f, where i : Y + B(Y) is 
the inclusion map. Verify the following: 

(i) If ly : X — X is the identity map of X, then B(1x) is the identity 

map of B(X). 

(iti) If f : X — Y and g : Y — Z, then B(g o f) = B(g) o BC f). 

These properties tell us that the correspondence we have constructed is what is 
called a functor; it is a functor from the “category” of completely regular spaces 
and continuous maps of such spaces, to the “category” of compact Hausdorff 
spaces and continuous maps of such spaces. You will see these properties again 
in Part II of the book; they are fundamentat in algebra and in algebraic topology. 


Chapter 6 


Metrization Theorems 
and Paracompactness 


The Urysohn metrization theorem of Chapter 4 was the first step—a giant one —toward 
an answer to the question: When is a topological space metnizable? It gives conditions 
under which a space X is metrizable. that it be regular and have a countable basis. But 
mathematicians are never satisfied with a theorem if there is some hope of proving a 
stronger one. In the present case, one can hope to strengthen the theorem by finding 
conditions on X that are both necessary and sufficient for X to be metrizable, that is, 
conditions that are equivalent to metnizability. 

We know that the regularity hypothesis in the Urysohn metrization theorem is a 
necessary one, but the countable basis condition is not. So the obvious thing to do is try 
to replace the countable basis condition by something weaker. Finding such condition 
is a delicate task. The condition has to be strong enough to imply metnzability, and yet 
weak enough that all metrizable spaces satisfy it. In a situation like this, discovenng 
the nght hypothesis is more than half the battle. 

The condition that was eventually formulated, by J. Nagata and Y. Smirnov inde- 
pendently, involves a new notion, that of local finiteness. We say that a collection A 
of subsets of a space X is locally finite if every point of X has a neighborhood that 
intersects only finitely many elements of A. 

Now one way of expressing the condition that the basis B is countable is to say 
that B can be expressed in the form 


B= |] Bas 


neZy 
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where each collection B, is finite. This is an awkward way of saying that B is count- 
able, but it suggests how to formulate a weaker version of it. The Nagata-Smimov 
condition is to require that the basis 8 can be expressed in the form 
B= |] Bn, 
neZ 

where each collection B, is locally finite. We say that such a collection B is count- 
ably locally finite. Surpnsingly enough, this condition, along with regulanty, is both 
necessary and sufficient for metnzability of X. This we shall we prove. 

There is another concept in topology that involves the notion of local finiteness. It 
is a generalization of the concept of compactness called “paracompactness.” Although 
of fairly recent origin, it has proved useful in many parts of mathematics. We introduce 
it here so that we can give another set of necessary and sufficient conditions for a 
space X to be metnzable. It turns out that X is metnzable if and only if it is both 
paracompact and locally metrizable. This we prove in §42. 

Some of the sections of this chapter are independent of one another. The depen- 
dence among them is expressed in the following diagram: 


§39 Local finiteness 
§40 | The Nagata-Smimov metnzation theorem 


§41 Paracompactness 


y 
§42 The Smimov metrization theorem 


§39 Local Finiteness 


In this sections we prove some elementary properties of locally finite collections and 
a crucial lemma about metnzable spaces. 


Definition. Let X be a topological space. A collection A of subsets of X is said to be 
locally finite in X if every point of X has a neighborhood that intersects only finitely 
many elements of A 
EXAMPLE 1 The collection of intervals 
A ={(n,0+2)|n eZ} 


is locally finite in the topological space R, as you can check. On the other hand, the 
collection 


B= {(0, l/a) | n € Z4} 
ts locally finite in (0, 1) but not in R, as is the collection 


C = {(1/(n + 1), 1/n) | n € Z4} 
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Lemma 39.1. Let A be a locally finite collection of subsets of X. Then: 
(a) Any subcollection of A is locally finite. 
(b) The collection B = {A} aca of the closures of the elements of A is locall ly finite. 


(e) Usea 4 = Unea A 


Proof. Statement (a) is tnvial. To prove (b), note that any open set U that intersects 
the set A necessarily intersects A. Therefore, if U is a neighborhood of x that intersects 
only finitely many elements A of A, then U can intersect at most the same number of 
sets of the collection B. (It might intersect fewer sets of B, since A and A? can be 
equal even though A; and A? are not). 

To prove (c), let Y denote the union of the elements of A: 


U4eyr. 


AEA 


In general, |) A C FY; we prove the reverse inclusion, under the assumption of local 
finiteness. Let x € Y; let U be a neighborhood of x that intersects only finitely many 
elements of A, say Aj, .. , Ax. We assert that x belongs to one of the sets Aj, 

, Ag, and hence belongs to LJ A. For otherwise, the set U — Aj — --- — Ay would 
be a neighborhood of x that intersects no element of A and hence does not intersect Y, 
contrary to the assumption that x € F. a 


There is an analogous concept of local finiteness for an indexed family of subsets 
of X. The indexed family {Ag}ae, is said to be a locally finite indexed family in X 
if every x € X has a neighborhood that intersects Ag for only finitely many values 
of a. What is the relation between the two formulations of local finiteness? It is easy 
to see that {Ag]}aes is a locally finite indexed family if and only if it is locally finite 
as a collection of sets and each nonempty subset A of X equals Ag for at most finitely 
many values of a. 

We shall be concerned with locally finite indexed families only in §41, when we 
deal with partitions of unity. 


Definition. A collection B of subsets of X is said to be countably locally finite if B 
can be wnitten as the countable union of collections B,, each of which is locally finite. 


Most authors use the term “o-locally finite” for this concept. The o comes from 
measure theory and stands for the phrase “countable union of” Note that both a count- 
able collection and a locally finite collection are countably locally finite. 


Definition. Let A be a collection of subsets of the space X. A collection B of subsets 
of X is said to be a refinement of A (or is said to refine A) if for each element B of B, 
there is an element A of A containing B. If the elements of B are open sets, we call B 
an open refinement of A; if they are closed sets, we call 8 a closed refinement 


246 Metrization Theorems and Paracompactness Ch. 6 


Lemma 39.2. Let X be a metnzable space. If A is an open covering of X, then there 
is an open covenng & of X refining A that is countably locally finite. 


Proof. We shall use the well-ordering theorem in proving this theorem. Choose a 
well-ordering < for the collection A. Let us denote the elements of A generically by 
the letters U, V, W,.... 

Choose a metric for X. Let n be a positive integer, fixed for the moment. Given an 
element U of A, let us define S,(U) to be the subset of U obtained by “shrinking” U 
a distance of |/n. More precisely, let 


Sn(U) = {x | B(x, 1/n) CUL 


(It happens that S,,(U) is a closed set, but that is not important for our purposes.) Now 
we use the well-ordering < of A to pass to a still smaller set. For each U in A, define 


Tr(U) = SU) — |] v. 
V<U 


The situation where A consists of the three sets U < V < W is pictured in 
Figure 39.1. Just as the figure suggests, the sets we have formed are disjoint. 


Figure 39.1 


In fact, they are separated by a distance of at least 1/n. This means that if V and W 
are distinct elements of A, then d(x, y) > 1/n whenever x € Ta(V) and y € T,(W). 

To prove this fact, assume the notation has been so chosen that V < W. Since x 
is in 7,(V), then x is in S,(V), so the 1/n-neighborhood of x lies in V. On the other 
hand, since V < W and y is in 7,,(W), the definition of the latter set tells us that y is 
not in V. It follows that y is not in the |/n-neighborhood of x. 

The sets 7,(U) are not yet the ones we want, for we do not know that they are 
open sets. (In fact, they are closed.) So let us expand each of them slightly to obtain 
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an open set E„ (U). Specifically, let E,(U) be the 1/3n-neighborhood of Tp (U); that 
is, let E„ (U) be the union of the open balls B(x, !/3n), for x € T,(U) 

In the case U < V < W, we have the situation pictured in Figure 39.2. As the 
figure suggests, the sets we have formed are disjoint. Indeed, if V and W are distinct 
elements of A, we assert that d(x, y) > 1/3n whenever x € E,(V) and y € E,(W); 
this fact follows at once from the triangle inequality. Note that for each V € A, the set 
E,,(V) is contained in V. 


gre waa woe 


Figure 39.2 


Now let us define 
En = [En (U) | U € A} 


We claim that &, is a locally finite collection of open sets that refines A. The fact 
that &, refines A comes from the fact that E,(V) C V for each V € A. The fact that 
&, is locally finite comes from the fact that for any x in X, the 1/6n-neighborhood of 
x can intersect at most one element of &,. 

Of course, the collection &,, will not cover X. (Figure 39 2 illustrates that fact.) 
But we assert that the collection 


é=J& 


nEZ4 


does cover X. 

Let x be a point of X. The collection A with which we began covers X; let us 
choose U to be the first element of A (in the well-ordering <) that contains x. Since U 
is open, we can choose n so that B(x, t/n) C U Then, by definition, x € S,(U). 
Now because U is the first element of A that contains x, the point x belongs to T, (U). 
Then x also belongs to the element E,(U) of En, as desired. a 
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Exercises 


1. Check the statements in Example 1. 


2. Find a point-finite open covering A of R that is not locally finite. (The collec- 
tion A is point-finite if each point of R lies in only finitely many elements of A.) 


3. Give an example of a collection of sets Æ that is not locally finite, such that the 
collection B = {A | A € A} is locally finite. 


4. Let A be the following collection of subsets of R: 
A = {(n,n +2) [ne Z} 
Which of the following collections refine A? 


B= {(x,x+1)|x eR), 
C = {(n,n + 3) |n eZ}, 
D=((x,x+3) lx ER). 


5. Show that if X has a countable basis, a collection Æ of subsets of X is countably 
locally finite if and only if it is countable 


6. Consider R® in the uniform topology. Given n, let B, be the collection of all 
subsets of R® of the form [] A;, where A; = R fori < n and A; equals either {0} 
or {|} otherwise. Show that the collection B = |] B, is countably locally finite, 
but neither countable nor locally finite. 


§40 The Nagata-Smirnov Metrization Theorem 


Now we prove that regularity of X and the existence of a countably locally finite basis 
for X are equivalent to metnzability of X. 

The proof that these conditions imply metrizability follows very closely the second 
proof we gave of the Urysohn metrization theorem. In that proof we constructed a map 
of the space X into R® that was an imbedding relative to the uniform metric 6 on R®. 
So let us review the major elements of that proof. The first step of the proof was 
to prove that every regular space X with a countable basis is normal. The second 
step was to construct a countable collection {fn} of real-valued functions on X that 
separated points from closed sets. The third step was to use the functions fna to define 
a map imbedding X in the product space R”. And the fourth step was to show that if 
fa(x) < 1/n for all x, then this map actually imbeds X in the metric space (R°, J). 

Each of these steps needs to be generalized in order to prove the general metniza- 
tion theorem. First, we show that a regular space X with a basis that is countably 
locally finite is normal. Second, we construct a certain collection of real-valued func- 
tions {fa} on X that separates points from closed sets. Third, we use these functions 
to imbed X in the product space R’, for some J. And fourth, we show that if the 
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functions f, are sufficiently small, this map actually imbeds X in the metric space 
(R’, 5) 

Before we start, we need to recall a notion we have already introduced in the 
exercises, that of a G; set. 


Definition. A subset A of a space X is called a G; set in X if it equals the intersection 
of a countable collection of open subsets of X. 


EXAMPLE |. Each open subset of X is a Gs set, invially In a first-countable Hausdorff 
space, each one-point set is a Gs set The one-point subset {9} of Sp is not a Gs set, as 
you can check 


EXAMPLE 2. Ina metne space X, each closed set is a G; set. Given A C X, let U(A, €) 
denote the ¢-neighborhood of A If A is closed, you can check that 


A= [] U(A, 1/n) 


neZy 


Lemma 40.1. Let X be a regular space with a basis B that is countably locally finite. 
Then X is normal, and every closed set in X is a Gs set in X 


Proof. Step 1. Let W be open in X. We show there is a countable collection {Un} of 


open sets of X such that 
w=|(Ju,=(Ju. 


Since the basis 8 for X is countably locally finite, we can write 8 = |_|) Bn, where 
each collection 8, is locally finite. Let C, be the collection of those basis elements B 
such that B € B, and B C W. Then C, is locally finite, being a subcollection of Ba 


Define 
Un = |] B. 
Bee, 


Then U, is an open set, and by Lemma 39 1, 


Ü, = U È. 


BeC, 


Therefore, Ün C W, so that 


Uun cl ūrcw 
We assert that equality holds. Given x € W, there is by regularity a basis element 


B € B such that x € B and B C W. Now B € &, for some n. Then B € C, by 
definition, so that x € Un. Thus W C |_J Un, as desired. 


Step 2. We show that every closed set C in X is a Gs set in X Given C, let 
W = X — C. By Step 1, there are sets Un in X such that W =|] Us Then 


C=(\x- Ün). 
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so that C equals a countable intersection of open sets of X. 

Step 3. We show X is normal Let C and D be disjoint closed sets in X. Applying 
Step | to the open set X — D, we construct a countable collection (U,} of open sets 
such that JU, = U Un = X — D. Then {Un} covers C and each set U, is disjoint 
from D. Similarly, there is a countable covering {Vn} of D by open sets whose closures 
are disjoint from C. 

Now we are back in the situation that arose in the proof that a regular space with a 
countable basis is normal (Theorem 32.1). We can repeat that proof verbatim. Define 


n n 
Up =U- |]V⁄ an vW =V -| ]ū: 
i=l i=l 
Then the sets 


U=|(Ju, ad way 


neZ neZ, 


are disjoint open sets about C and D, respectively. a 


Lemma 40.2. Let X be normal; let A be a closed Gs set in X. Then there is a 
continuous function f © X — [0, 1} such that f(x) = 0 forx € A and f(x) > 0 for 
x ¢A. 


Proof. We gave this as an exercise in §33, we provide a proof here. Write A as the 
intersection of the open sets Un, for n € Z4 For each n, choose a continuous function 
fa: X — [0, 1} such that f(x) = 0 for x € A and f(x) = l for x € X — Un Define 
f(x) = ¥ fa(x)/2" The series converges uniformly, by companson with Y 1/2", 
so that f is continuous. Also, f vanishes on A and is positive on X — A cs) 


Theorem 40.3 (Nagata-Smirnov metrization theorem). A space X is metnizable 
if and only if X is regular and has a basis that is countably locally finite. 


Proof. Step 1. Assume X is regular with a countably locally finite basis 8 Then 
X is normal, and every closed set in X is a Gs set in X. We shall show that X is 
metrizable by imbedding X in the metric space (R/, 5) for some J 

Let B = |) Ba, where each collection B, is locally finite. For each positive 
integer n, and each basis element B € B,, choose a continuous function 


fa.B © X — [0, 1/n] 


such that fa g(x) > O for x € B and fa,g(x) = O for x ¢ B. The collection ( f,, 2} 
separates points from closed sets in X. Given a point xọ and a neighborhood U of xo, 
there is a basis element B such that x» € B C U. Then B € B, for some n, so that 
Jn.8(Xo) > Oand fn,g vanishes outside U. 

Let J be the subset of Z, x B consisting of all pairs (n, B) such that B is an 
element of &,. Define 


F: X — 0. 
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by the equation 
F(x) = (fn, BO))inpyes- 


Relative to the product topology on (0, 1}”, the map F is an imbedding, by Theo- 
rem 34.2. 

Now we give [0, iy! the topology induced by the uniform metric and show that 
F is an imbedding relative to this topology as well. Here is where the condition 
fn,B(X)} < l/n comes in. The uniform topology is finer (larger) than the product 
topology. Therefore, relative to the uniform metric, the map F is injective and carries 
open sets of X onto open sets of the image space Z = F(X) We must give a separate 
proof that F is continuous. 

Note that on the subspace [0, 1]/ of R/, the uniform metric equals the metric 


P( (Xa), (Ya)) = sup{|Xe — Yal}. 


To prove conunuity, we take a point xy of X and a number € > 0, and find a neighbor- 
hood W of xg such that 


x € W => p(F(x). F(xo)) < € 


Let n be fixed for the moment. Choose a neighborhood U,, of xy that intersects 
only finitely many elements of the collection B,. This means that as B ranges over Bn, 
all but finitely many of the functions f,,,3 are identically equal to zero on U, . Because 
each function f,,g is continuous, we can now choose a neighborhood V, of xg con- 
tained in U, on which each of the remaining functions fn g, for B € B,, vanes by at 
most €/2. 

Choose such a neighborhood V, of xy for each n € Z,. Then choose N so that 
1/N < €/2, and define W = V, N ---N Vy We assert that W is the desired neighbor- 
hood of xo. Let x € W. Ifn < N, then 


| fn, B(x) c Ín.B (0) < €/2 


because the function f,, g either vanishes identically or varies by at most €/2 on W. If 
n> N, then 


\fn.B(*) — fa,a(%o)| < l/n < €/2 
because fn g maps X into (0, 1/7}. Therefore, 
p(F(x), F(xo)) s €/2 < €, 


as desired. 

Step 2. Now we prove the converse. Assume X is metrizable. We know X is 
regular; let us show that X has a basis that is countably locally finite. 

Choose a metric for X. Given m, let Am be the covering of X by all open balls 
of radius 1/m. By Lemma 39.2, there is an open covering Bm of X refining A,, such 
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that Bm is countably locally finite. Note that each element of Bm has diameter at 
most 2/m Let B be the union of the collections Bm, form € Z4}. Because each 
collection Bm is countably locally finite, so is B. We show that 8 is a basis for X. 
Given x € X and given € > 0, we show that there is an element B of B contain- 
ing x that is contained in B(x, €). First choose m so that 1/m < €/2. Then, because 
Bm covers X, we can choose an element B of Bm that contains x. Since B contains x 
and has diameter at most 2/m < €, it is contained in B(x, €), as desired. a 


Exercises 


1. Check the details of Examples | and 2. 


2. A subset W of X is said to be an “F, set” in X if W equals a countable union of 
closed sets of X. Show that W is an Fs set in X if and only if X — W is a G; set 
in X. 
[The termunology comes from the French. The “F” stands for “fermé,” which 
means “closed,” and the “o” for “somme,” which means “union.”} 


3. Many spaces have countable bases; but no T) space has a locally finite basis 
unless it is discrete. Prove this fact. 


4. Find a nondiscrete space that has a countably locally finite basis but does not 
have a countable basis. 


5. A collection A of subsets of X is said to be locally discrete if each point of X 
has a neighborhood that intersects at most one element of A. A collection B is 
countably locally discrete (or “o -locally discrete”) if it equals a countable union 
of locally discrete collections. Prove the following: 

Theorem (Bing metrization theorem). A space X is metrizable if and only if it 
is regular and has a basis that is countably locally discrete. 


§41 Paracompactness 


The concept of paracompactness is one of the most useful generalizations of compact- 
ness that has been discovered in recent years. It is particularly useful for applications 
in topology and differential geometry We shall give just one application, a metrization 
theorem that we prove in the next section. 

Many of the spaces that are familiar to us already are paracompact. For instance, 
every Compact space is paracompact, this will be an immediate consequence of the 
definition. It is also true that every metrizable space is paracompact; this is a theorem 
due to A. H. Stone, which we shall prove. Thus the class of paracompact spaces 
includes the two most important classes of spaces we have studied. It includes many 
other spaces as well. 


§41 Paracompactness 253 


To see how paracompactness generalizes compactness, we recall the definition of 
compactness: A space X is said to be compact if every open covering A of X contains 
a finite subcollection that covers X. An equivalent way of saying this is the following: 


A space X is compact if every open covering A of X has a finite open 
refinement B that covers X 


This definition is equivalent to the usual one; given such a refinement B, one can 
choose for each element of B an element of A containing it; in this way one obtains a 
finite subcollection of A that covers X. 

This new formulation of compactness is an awkward one, but it suggests a way to 
generalize: ; 


Definition. A space X is paracompact if every open covering A of X has a locally 
finite open refinement B that covers X. 


Many authors, following the lead of Bourbaki, include as part of the definition of 
the term paracompact the requirement that the space be Hausdorff. (Bourbaki also 
includes the Hausdorff condition as part of the definition of the term compact.) We 
shall not follow this convention. 


EXAMPLE |. The space R" is paracompact Let X = R" Let A be an open covernng 
of X. Let Bo = @, and for each positive integer m, let Bm denote the open ball of radius m 
centered at the ongin. Given m, choose finitely many elements of A that cover Bm and 
intersect each one with the open set X — Bm-1, let this finite collection of open sets be 
denoted Cm Then the collection C = J Cm is a refinement of A Itis clearly locally finite, 
for the open set B,, intersects only finitely many elements of C, namely those elements 
belonging to the collection CU U Cm. Finally, € covers X For, given x, let m be the 
smallest integer such that x € B Then x belongs to an element of Cm, by definition. 


Some of the properties of a paracompact space are similar to those of a compact 
space. For instance, a subspace of a paracompact space is not necessarily paracompact, 
but a closed subspace is paracompact. Also, a paracompact Hausdorff space is normal. 
In other ways, a paracompact space is not similar to a compact space; in particular, the 
product of two paracompact spaces need not be paracompact. We shall verify these 
facts shortly. 


Theorem 41.1. Every paracompact Hausdorff space X is normal 


Proof. The proof is somewhat similar to the proof that a compact Hausdorff space is 
normal. 

First one proves regularity Let a be a point of X and let B be a closed set of X 
disjoint from a. The Hausdorff condition enables us to choose, for each b in B, an open 
set U, about b whose closure is disjoint from a. Cover X by the open sets Up, along 
with the open set X — B; take a locally finite open refinement C that covers X. Form 
the subcollection D of C consisting of every element of C that intersects B. Then D 
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covers B. Furthermore, if D € D, then D is disjoint from a. For D intersects B, so it 
lies in some set U, whose closure is disjoint from a. Let 


v= | D; 


DeD 


then V is an open set in X containing B. Because D is locally finite, 


v= |J B. 
DeD 
so that V is disjoint from a. Thus regularity is proved. 
To prove normality, one merely repeats the same argument, replacing a by the 
closed set A throughout and replacing the Hausdorff condition by regularity. a 


Theorem 41.2. Every closed subspace of a paracompact space is paracompact. 


Proof. Let Y be a closed subspace of the paracompact space X; let Æ be a covering 
of Y by sets open in Y. For each A € A, choose an open set A’ of X such that 
A'N Y = A. Cover X by the open sets A’, along with the open set X — Y. Let B bea 
locally finite open refinement of this covenng that covers X. The collection 


C={BNY|Be 8} 
is the required locally finite open refinement of A a 


EXAMPLE 2. A paracompact subspace of a Hausdorff space X need not be closed in X. 
Indeed, the open interval (0, 1) is paracompact, being homeomorphic to R, but it is not 
closed in R 


EXAMPLE3 A subspace of a paracompact space need not be paracompact The space 

Sa x Sq is compact and, therefore, paracompact. But the subspace Sp x Sq is not para- 

compact, for it is Hausdorff but not normal. 

To prove the important theorem that every metrizable space is paracompact, we 
need the following lemma, due to E. Michael, which is also useful for other purposes: 


Lemma 41.3. Let X be regular. Then the following conditions on X are equivalent: 
Every open covering of X has a refinement that is: 
(1) An open covering of X and countably locally finite. 
(2) A covenng of X and locally finite. 
(3) A closed covering of X and locally finite. 
(4) An open covering of X and locally finite. 


Proof. It is trivial that (4) = (1). What we need to prove our theorem is the converse 
In order to prove the converse, we must go through the steps (1) => (2) > (3) > (4) 
anyway, so we have for convenience listed these conditions in the statement of the 
lemma. 
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(1) = (2). Let A be an open covering of X. Let B be an open refinement of A 
that covers X and is countably locally finite; let 


B=|J2, 
where each B, is locally finite. 
Now we apply essentially the same sort of shrinking trick we have used before to 
make sets from different B,’s disjoint. Given i, let 


Then for each n € Z4 and each element U of B,, define 


SU) =U -|) Vi. 


i<n 
[Note that Sp (U) is not necessarily open, nor closed.} Let 
Cn = {S,(U) | Ue Bn}. 


Then C, is a refinement of B,, because S,(U) C U foreach U € By. 

Let C = |] Cn. We assert that C is the required locally finite refinement of A, 
covering X 

Let x be a point of X. We wish to prove that x lies in an element of C, and 
that x has a neighborhood intersecting only finitely many elements of C. Consider the 
covering B = (J Bn; let N be the smallest integer such that x lies in an element of By. 
Let U be an element of By containing x. First, note that since x lies in no element of 
Bi fori < N, the point x lies in the element Sy(U/) of C. Second, note that since each 
collection B, is locally finite, we can choose for each n = 1,..., N a neighborhood 
W,, of x that intersects only finitely many elements of Ba. Now if W, intersects 
the element S,(V) of Cn, it must intersect the element V of B,, since S (V) C V. 
Therefore, W, intersects only finitely many elements of €,. Furthermore, because U 
is in By, U intersects no element of C,, for n > N As a result, the neighborhood 


Wi NW20---AWanU 


of x intersects only finitely many elements of C. 

(2) = (3) Let A be an open covering of X. Let B be the collection of all open 
sets U of X such that Ü is contained in an element of A. By regularity, B covers X. 
Using (2), we can find a refinement C of 8 that covers X and is locally finite. Let 


D={C|CeEC}. 


Then D also covers X; it is locally finite by Lemma 39.1; and it refines A. 

(3) = (4). Let A be an open covering of X. Using (3), choose £B to be a refine- 
ment of A that covers X and is locally finite (We can take 8 to be a closed refinement 
if we like, but that is irrelevant.) We seek to expand each element B of 8 slightly to 
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an open set, making the expansion slight enough that the resulting collection of open 
sets will still be locally finite and will still refine A. 

This step involves a new trick. The previous tnck, used several times, consisted of 
ordering the sets in some way and forming a new set by subtracting off all the previous 
ones. That trick shrinks the sets; to expand them we need something different. We 
shall introduce an auxiliary locally finite closed covering C of X and use it to expand 
the elements of B. 

For each point x of X, there is a neighborhood of x that intersects only finitely 
many elements of 8. The collection of all open sets that intersect only finitely many 
elements of B is thus an open covering of X. Using (3) again, let C be a closed 
refinement of this covering that covers X and is locally finite. Each element of C 
intersects only finitely many elements of B. 

For each element B of B, let 


C(B) ={C|C e @andC CX - B} 


Then define 


E(B)=X- |} c. 


CeC(B) 


Because C is a locally finite collection of closed sets, the union of the elements of any 
subcollection of C is closed, by Lemma 39.1. Therefore, the set E(B) is an open set. 
Furthermore, E(B) D B by definition. (See Figure 41 1, in which the elements of B 
are represented as closed circular regions and line segments, and the elements of C are 
represented as closed square regions.) 
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Figure 41.1 
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Now we may have expanded each B too much; the collection {E£ (8)} may not be 
a refinement of A. This is easily remedied. For each B € B, choose an element F (B) 
of A containing B. Then define 


D = [E(B)N F(B) | B € 8}. 


The collection D is a refinement of A. Because B C (E(B) F(B)) and B covers X, 
the collection D also covers X 

We have finally to prove that D is locally finite. Given a point x of X , choose a 
neighborhood W of x that intersects only finitely many elements of C, say C),..., Ck 
We show that W intersects only finitely many elements of D. Because € covers X, 
the set W is covered by C,,.. , Ck. Thus, it suffices to show that each element C of C 
intersects only finitely many elements of D. Now if C intersects the set E(B) F (B), 
then it intersects E(B), so by definition of E(B) it is not contained in X — B, hence C 
must intersect B. Since C intersects only finitely many elements of B, it can intersect 
at most the same number of elements of the collection D a 


Theorem 41.4. Every metnzable space is paracompact 


Proof. Let X be a metrizable space. We already know from Lemma 39.2? that, given 
an open covering A of X, it has an open refinement that covers X and is countably 
locally finite. The preceding lemma then implies that A has an open refinement that 
covers X and is locally finite a 


Theorem 41.5. Every regular Lindelof space is paracompact 


Proof. Let X be regular and Lindelof. Given an open covering A of X, it has a 
countable subcollection that covers X, this subcollection is automatically countably 
locally finite The preceding lemma applies to show A has an open refinement that 
covers X and is locally finite. a 


EXAMPLE 4 The product of two paracompact spaces need not be paracompact The 
space Ry is paracompact, for it is regular and Lindelöf However, Re x Re is not paracom- 
pact, for it is Hausdorff but not normal 


EXAMPLE 5. The space R® is paracompact in both the product and uniform topologies. 
This result follows from the fact that R” is metnzable in these topologies. It is not known 
whether R” is paracompact in the box topology (See the comment in Exercise 5 of §32 ) 


EXAMPLE 6. The product space R is not paracompact if J is uncountable For R’ is 
Hausdorff but not normal 


One of the most useful properties that a paracompact space X possesses has to do 
with the existence of partitions of unity on X. We have already seen the finite version 
of this notion in §36; we discuss the general case now. Recall that if ¢ - X — R, the 
support of @ is the closure of the set ot those x for which (x) Æ 0. 


258 Metrization Theorems and Paracompactness Ch. 6 


Definition. Let (Ua}aey be an indexed open covering of X. An indexed family of 
continuous functions 


Pa : X — [0, 1} 


is said to be a partition of unity on X, dominated by {Ua}, if: 
(1) (Support ġa) C Ua for each a. 
(2) The indexed family (Support gq} is locally finite 
(3) ¥ falx) = 1 for each x. 


Condition (2) implies that each x € X has a neighborhood on which the func- 
tion ¢q vanishes identically for all but finitely many values of a. Thus we can make 
sense of the “sum” indicated in (3); we interpret it to mean the sum of the terms ġa (x) 
that do not equal zero. 

We now construct a partition of unity on an arbitrary paracompact Hausdorff 
space. We begin by proving a “shrinking lemma,” just as we did for the finite case 
in §36. 


*Lemma 41.6. Let X be a paracompact Hausdorff space; let {Ua}ves be an in- 
dexed family of open sets covering X. Then there exists a locally finite indexed family 
{Valaey of open sets covering X such that Vy C Ua for each a. 


The condition that Vy C Ua for each a is sometimes expressed by saying that the 
family {Va} is a precise refinement of the family {Ua}. 


Proof. Let A be the collection of all open sets A such that A is contained in some 
element of the collection {Ua}. Regularity of X implies that A covers X. Since X 
is paracompact, we can find a locally finite collection 8 of open sets covering X that 
refines A Let us index B bijectively with some index set K , then the general element 
of B can be denoted Bg, for $ € K, and {Bg} sex is a locally finite indexed family. 
Since B refines A, we can define a function f : K — J by choosing, for each $ in K, 
an element f(8) € J such that 


Bg C Uppy. 
Then for each œ € J, we define V, to be the union of the elements of the collection 
By = (Bg | f(B) =e}. 


(Note that Vg is empty if there exists no index £ such that f(8) = a.) For each 
element Bg of the collection By we have Bg C Ug by definition. Because the collec- 
tion B, is locally finite, Vy equals the union of the closures of the elements of B,, so 
that Vy C Ua. 

Finally, we check local finiteness Given x € X, choose a neighborhood W of x 
such that W intersects Bg for only finitely many values of $, say £ = B,..., Bx. 
Then W can intersect Vg only if a is one of the indices f (81), ..., f (8x). a 
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*Theorem 41.7. Let X be a paracompact Hausdorff space; let {U,}qaes be an indexed 
open covering of X. Then there exists a partition of unity on X dommated by {Ua}. 


Proof. We begin by applying the shrinking lemma twice, to find locally finite indexed 
familes of open sets {Wa} and {Va} covering X, such that Wa C Vx and Vy C Ua 
for each œ Since X is normal, we may choose, for each a, a continuous function 
Wa : X — [0,1] such that Yae(Wa) = {1} and Yae(X — Va) = {0}. Since Wa is 
nonzero only at points of Va, we have 


(Support Ya) C Va C Ua. 


Furthermore, the indexed family {Va} is locally finite (since an open set intersects Vy 
only if it intersects Vy); hence the indexed family (Support Ye} is also locally finite. 
Note that because {Wg} covers X, for any given x at least one of the functions Ye is 
positive at x. 

We can now make sense of the formally infinite sum 


W(x) =) vals) 


Since each x € X has a neighborhood W, that intersects the set (Support wa) for 
only finitely many values of a, we can interpret this infinite sum to mean the sum of 
its (finitely many) nonzero terms. It follows that the restnction of Y to W, equals a 
finite sum of continuous functions, and is thus continuous. Then since Y is continuous 
on W, for each x, it is continuous on X. It is also positive. We now define 


alx) = Yalx)/ V(x) 


to obtain our desired partition of unity. a 


Partitions of unity are most often used in mathematics to “patch together” func- 
tions that are defined locally so as to obtain a function that is defined globally. Their 
use in §36 illustrates this process. Here is another such illustration: 


*Theorem 41.8. Let X be a paracompact Hausdorff space: let C be a collection of 
subsets of X; for each C € C, let eç be a positive number. If C is locally finite, there 
is a continuous function f : X — R such that f(x) > 0 for all x, and f (x) < ec for 
xéeC. 


Proof. Cover X by open sets each of which intersects at most finitely many elements 
of C; index this collection of open sets so that it becomes an indexed family (Ug }ae J. 
Choose a partition of unity {ġa} on X dominated by {Ue}. Given a, let 5, be the 
minimum of the numbers eç, as C ranges over all those elements of C that intersect 
the support of $a; if there are no such elements of C, set 6, = 1. Then define 


fœ) = J bapa (x). 
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Because all the numbers 5, are positive, so is f. We show that f(x) < ec for x € C. 
It will suffice to show that for x € C and arbitrary œ, we have 


(*) 


baba(x) S €cba(x); 


then the desired inequality follows by summung, as $` ġa (x) = 1. If x ¢ Support ġe, 
then inequality (+) is trivial because ¢a(x) = 0. And if x € Support ġa and x € C, 
then C intersects the support of ġa, so that dg < ec by construction; thus (x) holds. W 


Exercises 


1. 


*7, 


Give an example to show that if X is paracompact, it does not follow that for 
every open covering Æ of X, there is a locally finite subcollection of A that 
covers X. 


. (a) Show that the product of a paracompact space and a compact space is para- 


compact. (Hint: Use the tube lemma.] 
(b) Conclude that Se is not paracompact. 


. Is every locally compact Hausdorff space paracompact? 
. (a) Show that if X has the discrete topology, then X is paracompact. 


(b) Show that if f : X — Y is continuous and X is paracompact, the sub- 
space f(X) of Y need not be paracompact. 


. Let X be paracompact. We proved a “shrinking lemma” for arbitrary indexed 


open coverings of X. Here is an “expansion lemma” for arbitrary locally finite 
indexed families in X. 

Lemma. Let {By}acy be a locally finite indexed family of subsets of the para- 
compact Hausdorff space X. Then there is a locally finite indexed family {Ua }aes 
of open sets in X such that By C Ua for eacha. 


. (a) Let X be a regular space. If X is a countable union of compact subspaces 


of X, then X is paracompact. 

(b) Show R” is paracompact as a subspace of R® in the box topology. 

Let X be a regular space. 

(a) If X is a finite union of closed paracompact subspaces of X, then X is para- 
compact. 

(b) If X is a countable union of closed paracompact subspaces of X whose inte- 
riors cover X, show X is paracompact. 


. Let p : X — Y bea perfect map. (See Exercise 7 of §31.) 


(a) Show that if Y is paracompact, so is X. (Hint: If A is an open covering of X, 
find a locally finite open covering of Y by sets B such that p~!(B) can be 
covered by finitely many elements of A; then intersect p~!(B) with these 
elements of A.] 

(b) Show that if X is a paracompact Hausdorff space, then so is Y. [Hint: If 8 
is a locally finite closed covering of X, then {p(B) | B € 8} is a locally 
finite closed covering of Y.] 
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9. Let G be a locally compact, connected topological group Show that G is para- 
compact [Hint. Let U; be a neighborhood of e having compact closure. In 
general, define U,4) = Ün - U1. Show the union of the sets Ü, is both open and 
closed in G.] 

This result holds without assuming G is connected, but the proof requires more 
effort. 


10. Theorem. If X is a Hausdorff space that is locally compact and paracompact, 
then each component of X has a countable basis. 
Proof. If Xo is a component of X, then Xo is locally compact and paracompact. 
Let C be a locally finite covering of Xo by sets open in Xo that have compact 
closures Let U be anonempty element of C, and in general let U, be the union 
of all elements of C that intersect U,_1. Show U, is compact, and the sets L/, 
cover Xo. 


§42 The Smirnov Metrization Theorem 


The Nagata-Smimov metrization theorem gives one set of necessary and sufficient 
conditions for metrizability of a space. In this section we prove a theorem that gives 
another such set of conditions. It is a corollary of the Nagata-Smirnov theorem and 
was first proved by Smimov. 


Definition. A space X is locally metrizable if every point x of X has a neighbor- 
hood U that is metrizable in the subspace topology 


Theorem 42.1 (Smirnov metrization theorem). A space X is metnzable if and 
only if it is a paracompact Hausdorff space that is locally metnzable. 


Proof. Suppose that X is metrizable. Then X is locally metrizable; it is also para- 
compact, by Theorem 41 4. 

Conversely, suppose that X is a paracompact Hausdorff space that is locally metriz- 
able. We shall show that X has a basis that is countably locally finite. Since X is 
regular, it will then follow from the Nagata-Smirnov theorem that X is metrizable. 

The proof is an adaptation of the last part of the proof of Theorem 40.3. Cover X 
by open sets that are metrizable; then choose a locally finite open refinement C of 
this covering that covers X. Each element C of C is metrizable; let the function dc : 
C x C — R be a metnc that gives the topology of C. Given x € C, let Bc(x, €) 
denote the set of all points y of C such that dc(x, y) < e. Being open in C, the set 
Bc (x, €) is also open in X. 

Given m € Z4, let Æm be the covering of X by all these open balls of radius 1/m; 
that is, let 


Am = {Bc(x, l/m) | x € CandC € C}. 
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Let Dm be a locally finite open refinement of Am that covers X (Here we use para- 
compactness.) Let D be the union of the collections D,,. Then D is countably locally 
finite. We assert that D is a basis for X; our theorem follows 

Let x be a point of X and let U be a neighborhood of x. We seek to find an 
element D of D such that x € D C U. Now x belongs to only finitely many elements 
of C, say to Ci,. ., Ck Then U N C; is a neighborhood of x in the set C,, so there is 
an €; > 0 such that 


Be, (x, €) c (U ACi). 


Choose m so that 2/m < min{€;, ... , €x}. Because the collection Dm covers X, there 
must be an element D of Dm containing x. Because Dm refines A,,, there must be 
an element Bc(y, 1/m) of An, for some C € C and some y € C, that contains D. 
Because 


xeDC Bcly, l/m), 


the point x belongs to C, so that C must be one of the sets C,,.. , Cy. Say C = Cj. 
Since Bc(y, 1/m) has diameter at most 2/m < €,, it follows that 


xeEeDc Bc, (y, l/m) C Bc, (x, €i) CU, 


as desired. a 


Exercises 


1. Compare Theorem 42.1 with Exercises 7 and 8 of §34. 


2. (a) Show that for each x € Sq, the section of Sg by x has a countable basis and 
hence is metrizable. 
(b) Conclude that Sc is not paracompact. 


Chapter 7 


Complete Metric Spaces 
and Function Spaces 


The concept of completeness for a metric space is one you may have seen already. It is 
basic for all aspects of analysis. Although completeness is a metric property rather than 
a topological one, there are a number of theorems involving complete metric spaces 
that are topological in character. In this chapter, we shall study the most important 
examples of complete metric spaces and shall prove some of these theorems. 

The most familiar example of a complete metric space is euclidean space in either 
of its usual metncs. Another example, just as important, is the set C(X, Y) of all 
continuous functions mapping a space X into a metric space Y This set has a metric 
called the uniform metric, analogous to the uniform metric defined for R” in §20. If Y 
is a complete metric space, then C(X, Y) is complete in the uniform metric. This we 
demonstrate in §43. As an application, we construct in §44 the well-known Peano 
space-filling curve. 

One theorem of topological character concerning complete metric spaces is a the- 
orem relating compactness of a space to completeness. We prove it in §45 An im- 
mediate corollary is a theorem concerning compact subspaces of the function space 
@(X, R"); itis the classical version of a famous theorem called Ascoli’s theorem 

There are other useful topologies on the function space C(X, Y) besides the one 
derived from the uniform metric. We study some of them in §46, leading to a proof of 
a general version of Ascoli’s theorem in §47. 


263 


264 Complete Metric Spaces and Function Spaces Ch 7 


§43 Complete Metric Spaces 


In this section we define the notion of completeness and show that if Y is a complete 
metric space, then the function space C(X, Y) is complete in the uniform metnc. We 
also show that every metric space can be imbedded isometncally in a complete metric 
space. 


Definition. Let (X,d) be a metric space A sequence (x,) of points of X is said to 
be a Cauchy sequence in (X,d) if it has the property that given € > 0, there is an 
integer N such that 


d(Xn, Xm) <€  whenevern,m>N 


The metric space (X, d) is said to be complete if every Cauchy sequence in X con- 
verges. 


Any convergent sequence in X is necessarily a Cauchy sequence, of course; com- 
pleteness requires that the converse hold 

Note that a closed subset A of a complete metric space (X, d) is necessarily com- 
plete in the restricted metric. For a Cauchy sequence in A is also a Cauchy sequence 
in X, hence it converges in X. Because A is a closed subset of X, the limit must lie in 
A. 

Note also that if X is complete under the metric d, then X is complete under the 
standard bounded metric 


d(x, y) = min(d(x, y). 1} 


corresponding to d, and conversely. For a sequence (xn) is a Cauchy sequence under d 
if and only if it is a Cauchy sequence under d And a sequence converges under d if 
and only if it converges under d. 

A useful criterion for a metric space to be complete is the following: 


Lemma 43.1. A metnc space X is complete if every Cauchy sequence in X has a 
convergent subsequence. 


Proof. Let (x,) be a Cauchy sequence in (X,d) We show that if (x,) has a sub- 
sequence (x,,) that converges to a point x, then the sequence (x,) itself converges 
tox. 

Given € > 0, first choose N large enough that 


d(xn, Xm) < €/2 


for all n,m > N [using the fact that (x,) is a Cauchy sequence}. Then choose an 


integer i large enough that n; > N and 


d(Xn,, x) < €/2 
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(using the fact that n} < n2 < is an increasing sequence of integers and Xn, 
converges to x]. Putting these facts together, we have the desired result that forn > N, 


d (Xn, X) < d(Xq, Xn,) + d(Xn,, X) < €. a 


Theorem 43.2. Euclidean space RÝ is complete in either of its usual metncs, the 
euclidean metnc d or the square metric p. 


Proof. To show the metric space (R$, p) is complete, let (x2) be a Cauchy sequence 
in (R*, p). Then the set {x,} is a bounded subset of (R*, p). For if we choose N so 
that 


P(Xn» Xm) Š l 
for all n,m > N, then the number 
M = max{p({x1, 9),..., p(xn—1, 0), p(x, 0) + 1} 


is an upper bound for p(x, 0). Thus the points of the sequence (x,,) all lie in the cube 
[-M, M]*. Since this cube is compact, the sequence (x,) has a convergent subse- 
quence, by Theorem 28.2. Then (R*, p) is complete. 

To show that (RÉ, d) is complete, note that a sequence is a Cauchy sequence rela- 
tive to d if and only if it is a Cauchy sequence relative to p, and a sequence converges 
relative to d if and only if it converges relative to p. B 


Now we deal with the product space R”. We need a lemma about sequences in a 
product space. 


Lemma 43.3. Let X be the product space X = [] Xq; let Xn be a sequence of points 
of X. Then X, — x if and only if Ħa (Xn) —> Ħa(X) for eacha. 


Proof. This result was given as an exercise in §19; we give a proof here. Because the 
projection mapping 7, . X —> Xa is continuous, it preserves convergent sequences; 
the “only if” part of the lemma follows. To prove the converse, suppose 7_ (Xn) > 
q(x) for each æ € J. Let U = Il Uy be a basis element for X that contains x. For 
each a for which Ua does not equal the entire space Xg, choose Ny so that Ta(Xn) € 
Ua forn > Na. Let N be the largest of the numbers Ng, then for all n > N, we have 
Xn EU a 


Theorem 43.4. There is a metric for the product space R” relative to which R® is 
complete. 


Proof. Let d(a, b) = min{|a — b|, 1} be the standard bounded metric on R. Let D be 
the metric on R® defined by 


D(x, y) = sup{d(xj, yi)/i}- 
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Then D induces the product topology on R®; we verify that R” is complete under D. 
Let x, be a Cauchy sequence in (R”, D). Because 


d(x; (x), mi(y)) < iD(x, y), 


we see that for fixed i the sequence zr; (Xn) is a Cauchy sequence in R, so it converges, 
say to a;. Then the sequence x, converges to the point a = (a1, a2,...) of R®. a 


EXAMPLE I. An example of 2 noncomplete metric space is the space Q of rational 
numbers in the usual metric d(x, y) = |x — yl. For instance, the sequence 


1.4, 1.41, 1.414, 1.4142, 1.41421,... 


of finite decimals converging (in R) to VJ2isa Cauchy sequence in Q that does not converge 


(in Q). 


EXAMPLE 2. Another noncomplete space is the open interval (—1, 1) in R, in the metric 
d(x, y) = |x — yl. In this space the sequence (x,,) defined by 


Xn =1—1/n 


is a Cauchy sequence that does not converge. This example shows that completeness is 
not a topological property, that is, it is not preserved by homeomorphisms For (~1, 1) is 
homeomorphic to the real line R, and R is complete in its usual metric. 


Although both the product spaces R” and R” have metrics relative to which they 
are complete, one cannot hope to prove the same result for the product space R” in 
general, because R” is not even metrizable if J is uncountable (see §21). There is, 
however, another topology on the set R/, the one given by the uniform metnc. Relative 
to this metric, R7 is complete, as we shall see. 

We define the uniform metnc in general as follows: 


Definition. Let (Y,d) be a metnc space; let d(a,b) = min{d (a, b), 1} be the stan- 
dard bounded metric on Y derived from d. If X = (Xa)aes and y = (Ya)aes are points 
of the cartesian product Y/, let 


B(x, y) = sup{d(Xa, ya) læ € J}. 


It is easy to check that p is a metric; it is called the uniform metric on Y J correspond- 
ing to the metric d on Y. 


Here we have used the standard “tuple” notation for the elements of the cartesian 
product Y7. Since the elements of Y/ are simply functions from J to Y, we could 
also use functional notation for them. In this chapter, functional notation will be more 
convenient than tuple notation, so we shall use it throughout. In this notation, the 
definition of the uniform metnc takes the following form: If f, g : J — Y, then 


PC. g) = supld( f(a), g(@)) |a € J). 
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Theorem 43.5. If the space Y is complete in the metric d, then the space Y7 is 
complete in the uniform metric p corresponding to d. 


Proof. Recall that if (Y, d) is complete, so is (Y, d), where d is the bounded metric 
corresponding to d. Now suppose that fi, f2, ... is a sequence of points of Y7 that is 
a Cauchy sequence relative to #. Given « in J, the fact that 


d( fala). fm(@)) < Elfa, fm) 


for all n, m means that the sequence fı (œ), f2(@),... is a Cauchy sequence ın (Y, d). 
Hence this sequence converges, say to a point yg. Let f : J — Y be the function 
defined by f(a) = Ya. We assert that the sequence (/,) converges to f in the metnc p. 

Given € > 0, first choose N large enough that 6( fn, fm) < €/2 whenever n, m > 
N. Then, in particular, 


d( fala), Sm(@)) < e/2 


for n,m > N anda € J. Letting n and a be fixed, and letting m become arbitrarily 
large, we see that 


di fala), f(a) < €/2. 
This inequality holds for all a in J, provided merely that n > N. Therefore, 


Bln, f) S €/2 <€ 


for n > N, as desired. a 


Now let us specialize somewhat, and consider the set Y* where X is a topological 
space rather than merely a set. Of course, this has no effect on what has gone before; 
the topology of X is irrelevant when considering the set of all functions f : X > Y. 
But suppose that we consider the subset C(X, Y) of Y X consisting of all continuous 
functions f : X — Y. It turns out that if Y is complete, this subset is also complete 
in the uniform metnc. The same holds for the set B(X, Y) of all bounded functions 
f : X — Y. (A function f is said to be bounded if its image f(X) is a bounded 
subset of the metnc space (Y, d).) 


Theorem 43.6. Let X be a topological space and let (Y, d) be a metric space. The 
set @(X, Y) of continuous functions is closed in Y¥ under the uniform metric. So is 
the set B(X, Y) of bounded functions. Therefore, if Y is complete, these spaces are 
complete in the uniform metric. 


Proof. The first part of this theorem is just the uniform limit theorem (Theorem 21.6) 
in a new guise. First, we show that if a sequence of elements f, of Y¥ converges to 
the element f of Y* relative to the metric 6 on Y%, then it converges to f uniformly 
in the sense defined in §21, relative to the metric d on Y. Given e > 0, choose an 
integer N such that 


Af. Sn) <E 


f 
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forall n > N. Then forall x € X and alln > N, 
dl fax), F) < Pla, f) < €. 


Thus (fn) converges uniformly to f. 

Now we show that @(X, Y) is closed in Y¥ relative to the metric ō. Let f be 
an element of Y* that is a limit point of C(X, Y). Then there is a sequence (f,) of 
elements of @(X, Y) converging to f in the metric A. By the uniform limit theorem, 
f is continuous, so that f € C(X, Y). 

Finally, we show that B(X, Y) is closed in Y*. If f is a limit point of B(X, Y), 
there is a sequence of elements f, of B(X, Y) converging to f. Choose N so large 
that 6( fw. f) < 1/2. Then forx € X, we have d( f(x), f(x)) < 1/2, which implies 
that d(fy(x), f(x)) < 1/2. It follows that if M is the diameter of the set fy (X), then 
f(X) has diameter at most M + l. Hence f € B(X, Y). 

We conclude that C(X, Y) and B(X, Y) are complete in the metric if Y is com- 
plete in d. a 


Definition. If (Y,d) is a metnc space, one can define another metnc on the set 
B(X, Y) of bounded functions from X to Y by the equation 


plf, 8) = sup{d( f(x), g(x)) | x € X}. 


It is easy to see that p is well-defined, for the set f (X)Ug(X) is bounded if both f (X) 
and g(X) are. The metnc p is called the sup metric. 


There is a simple relation between the sup metric and the uniform metnc. Indeed, 
if f,g € B(X, Y), then 


B(S. g) = min{o(f, g), 1}. 


For if p(f,g) > 1, then d(f (xo), g(x0)) > 1 for at least one xo € X, so that 
d( (x0). 2(Xo)) = l and 5( f, g) = 1 by definition. On the other hand, if p(f, g) < 1, 
then d(f (x), g(x)) = d(f (x), g(x)) < 1 forall x, so that 6(f, g) = p( f, g). Thus on 
B(X, Y), the metnc J is just the standard bounded metnc derived from the metne p. 
That is the reason we introduced the notation 6 for the uniform metric, back in §20! 

If X is a compact space, then every continuous function f : X — Y is bounded; 
hence the sup metic is defined on C(X, Y). If Y is complete under d, then C(X, Y) 
is complete under the corresponding uniform metric 9, so it is also complete under 
the sup metric p. We often use the sup metric rather than the uniform metric in this 
situation. 

We now prove a classical theorem, to the effect that every metnc space can be 
imbedded isometncally in a complete metric space. (A different proof, somewhat 
more direct, is outlined in Exercise 9.) Although we shall not need this theorem, it is 
useful in other parts of mathematics. 
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“Theorem 43.7. Let (X, d) be a metric space. There is an isometric imbedding of X 
into a complete metric space. 


Proof. Let B(X, R) be the set of all bounded functions mapping X into R. Let xo be 
a fixed point of X. Given a € X, define ĝa : X — R by the equation 


a(x) = d(x, a) — d(x, xo). 
We assert that ¢, is bounded. For it follows, from the inequalities 


d(x,a) < d(x, b) + d(a,b), 
d(x, b) < d(x,a) + d(a,b), 


that 
ld(x, a) — d(x, b)| < d(a, b). 


Setting b = xo, we conclude that [ġa (x)| < d(a, xo) for all x. 
Define ® : X > B(X, R) by setting 


D(a) = ga. 


We show that © is an isometnc imbedding of (X, d) into the complete metric space 
(B(X, R), p). That is, we show that for every pair of points a, b € X, 


P(ba. $b) = d(a,b). 
By definition, 


p(Ga, $b) = sup(|pa(x) — $o (x)l; x E€ X} 
= sup{|d(x, a) — d(x, b)|; x € X}. 


We conclude that 
P(ga, $b) < d(a, b). 
On the other hand, this inequality cannot be strict, for when x = a, 


|d(x, a) — d(x, b)| = d(a, b). a 


Definition. Let X be a metric space. If h : X — Y is an isometnc imbedding of X 
into a complete metnic space Y, then the subspace h(X) of Y is a complete metric 
space. It is called the completion of X. 


The completion of X is uniquely determined up to an isometry. See Exercise 10. 
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Exercises 


1. Let X be a metric space. 

(a) Suppose that for some € > 0, every €-ball in X has compact closure. Show 
that X is complete. 

(b) Suppose that for each x € X there is an €e > O such that the ball B(x, €) 
has compact closure. Show by means of an example that X need not be 
complete. 

2. Let (X, dx) and (Y, dy) be metric spaces; let Y be complete. Let A C X. Show 
that if f : A > Y is uniformly continuous, then f can be uniquely extended to 
a continuous function g : A > Y, and g is uniformly continuous. 

3. Two metrics d and d’ ona set X are said to be metrically equivalent if the identity 
map i : (X, d) — (X, d’) and its inverse are both uniformly continuous. 

(a) Show that d is metrically equivalent to the standard bounded metric d de- 
rived from d. 

(b) Show that if d and d’ are metrically equivalent, then X is complete under d 
if and only if it is complete under d’. 

4. Show that the metnc space (X,d) is complete if and only if for every nested 
sequence A; > A2 D --- of nonempty closed sets of X such that diam A, — 0, 
the intersection of the sets A, is nonempty. 

5. If (X, d) is a metne space, recall that a map f : X — X is called a contraction 
if there is a number œ < | such that 


d(f (x), f(y)) < ad(x, y) 


for all x, y € X. Show that if f is a contraction of a complete metric space, then 
there is a unique point x € X such that f(x) = x. Compare Exercise 7 of §28. 


6. A space X is said to be topologically complete if there exists a metric for the 

topology of X relative to which X is complete. 

(a) Show that a closed subspace of a topologically complete space is topologi- 
cally complete. 

(b) Show that a countable product of topologically complete spaces is topologi- 
cally complete (in the product topology). 

(c) Show that an open subspace of a topologically complete space is topolog- 
ically complete. (Hint: If U C X and X is complete under the metne d, 
define ġ : U > R by the equation 


g(x) = 1/d(x, X — U). 


Imbed U in X x R by setting f(x) = x x ġ(x).] 

(d) Show that if A is a Gs set in a topologically complete space, then A is 
topologically complete. [Hint: Let A be the intersection of the open sets 
Un, forn € Z4. Consider the diagonal imbedding f(a) = (a,a,...} of A 
into [] U,.] Conclude that the irrationals are topologically complete. 
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7. Show that the set of all sequences (x), x2, ...)} such that x? converges is 
complete in the £?-metnc. (See Exercise 8 of §20.) 
8. If X and Y are spaces, define 


e:Xx@(x,Y)> Y 


by the equation e(x, f) = f(x); the map e is called the evaluation map. Show 
that if d is a metric for Y and C(X, Y) has the corresponding uniform topology, 
then e is continuous. We shall generalize this result in §46. 

9. Let (X, d) be a metnc space. Show that there is an isometric imbedding h of X 
into a complete metne space (Y, D), as follows: Let X denote the set of all 
Cauchy sequences 


X= (x1, 22...-) 
of points of X. Define x ~ y if 
d(Xn, Yn) — 0. 


Let [x] denote the equivalence class of x; and let Y denote the set of these equiv- 
alence classes. Define a metric D on Y by the equation 


D((x], [yD = lim dn, yn). 


(a) Show that ~ is an equivalence relation, and show that D is a well-defined 
metric. 

(b) Define h : X — Y by letting A(x) be the equivalence class of the constant 
sequence (x,X,...): 


h(x) = (x. x,...)]. 


Show that h is an isometric imbedding. 

(c) Show that A(X) is dense in Y; indeed, given x = (x1, x2,...) € X, show 
the sequence h(x,,) of points of Y converges to the point [x] of Y. 

(d) Show that if A is a dense subset of a metnc space (Z, p), and if every Cauchy 
sequence in A converges in Z, then Z is complete. 

(e) Show that (Y, D) is complete. 

10. Theorem (Uniqueness of the completion). Leth: X — Y andh’: X — Y’ be 

isometric imbeddings of the metric space (X, d) in the complete metric spaces 

(Y, D) and (¥Y', D’), respectively. Then there is an isometry of (h(X), D) with 

(h' (X), D’) that equals h'h—! on the subspace h(X). 


*§44 A Space-Filling Curve 


As an application of the completeness of the metnc space @(X, Y) in the uniform 
metric when Y is complete, we shall construct the famous “Peano space-filling curve.” 
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Theorem 44.1. Let J = (0, 1]. There exists a continuous map f : 1 —> 1? whose 
image fills up the entire square 1°. 


The existence of this path violates one’s naive geometne intuition in much the 
same way as does the existence of the continuous nowhere-differentiable function 
(which we shall come to later). 


Proof. Step 1. We shall construct the map f as the limit of a sequence of continuous 
functions fa. First we describe a particular operation on paths, which will be used to 
generate the sequence fh- 

Begin with an arbitrary closed interval [a, b] in the real line and an arbitrary square 
in the plane with sides parallel to the coordinate axes, and consider the triangular path g 
pictured in Figure 44.1. It is a continuous map of [a, b] into the square. The operation 
we wish to describe replaces the path g by the path g’ pictured in Figure 44.2. It is 
made up of four triangular paths, each half the size of g. Note that g and g’ have the 
same initial point and the same final point. You can wnte the equations for g and g’ if 
you like. 


g 
h 
F 
a b 
Figure 44.1 
g' 
~ 
H 
Figure 44.2 


This same operation can also be applied to any tnangular path connecting two 
adjacent corners of the square. For instance, when applied to the path A pictured in 
Figure 44.3, it gives the path h’. 

Step 2. Now we define a sequence of functions fa : / — I*. The first function, 
which we label fo for convenience, is the tnangular path pictured in Figure 44. |, letting 
a = Oand b = |. The next function f; is the function obtained by applying the 
operation described in Step | to the function fo; it is pictured in Figure 44.2. The next 
function fz is the function obtained by applying this same operation to each of the four 
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Pe Ds 
m aans | 


Figure 44.3 


tnangular paths that make up fı. It is pictured in Figure 44.4. The next function f3 
is obtained by applying the operation to each of the 16 triangular paths that make up 
fa; it is pictured in Figure 44.5. And so on. At the general step, fa is a path made 
up of 4” triangular paths of the type considered in Step 1, each lying in a square of 
edge length 1/2". The function f,,; is obtained by applying the operation of Step | 
to these tnangular paths, replacing each one by four smaller trangular paths. 


Figure 44.5 
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Step 3. For purposes of this proof, let d(x, y) denote the square metric on R?, 
d(x, y) = max{|x; — yıl, [x2 — yal). 
Then we can let p denote the corresponding sup metnc on C@(/, 17): 


pí f, 8) = sup{d( f(t), g(t) |t € I}. 


Because /? is closed in R?, it is complete in the square metnic; then C(/, I?) is com- 
plete in the metric p. 

We assert that the sequence of functions (f,) defined in Step 2 is a Cauchy se- 
quence under p. To prove this fact, let us examine what happens when we pass from fa 
to fnr41. Each of the small triangular paths that make up fn lies in a square of edge 
length 1/27. The operation by which we obtain f,41 replaces each such triangular 
path by four triangular paths that lie in the same square. Therefore, in the square 
metric on 7?, the distance between falt) and f,41(2) is at most 1/2". As a result, 
P| fa, fa+1) < 1/2". It follows that (fa) is a Cauchy sequence, since 


PUSn» farm) < 1/2" + 72th ee arte! < 272" 


for all n and m. 

Step 4. Because @(/, I?) is complete, the sequence fn converges to a continuous 
function f : I + 1?. We prove that f is surjective. 

Let x be a point of 77; we show that x belongs to f (Z). First we note that, given n, 
the path fan comes within a distance of 1/2” of the point x. For the path f, touches 
each of the little squares of edge length 1/2" into which we have divided /?. 

Using this fact, we shall prove that, given € > 0, the €-neighborhood of x inter- 
sects f(/). Choose N large enough that 


e(fy. f) <€/2 and 1/2 <€/2. 


By the result of the previous paragraph, there is a point fo € / such that d(x, fy (to)) < 
1/2%. Then since d(fw(t), f(t)) < €/2 for all t, it follows that 


d(x, f(to)) < €, 


so the €-neighborhood of x intersects f(/). 
It follows that x belongs to the closure of f(/). But 7 is compact, so f(/) is 
compact and is therefore closed. Hence x lies in f(/), as desired. a 


Exercises 


1. Given n, show there is a continuous surjective map g : J — I". [Hint: Consider 
fxf:lxl> Px] 
2. Show there is a continuous surjective map f : R > R”. 
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3. (a) If R® is given the product topology, show there is no continuous surjective 
map f : R > R”. [Hint: Show that R” is not a countable union of compact 
subspaces. ]} 

(b) If R® is given the product topology, determine whether or not there is a 
continuous surjective map of R onto the subspace R™. 

What happens to the statements in (a) and (b) if R® is given the uniform 

topology or the box topology? 


(c 


— 


4. (a) Let X be a Hausdorff space. Show that if there is a continuous surjective 
map f : / — X, then X is compact, connected, weakly locally connected, 
and metrizable. (Hint: Show f is a perfect map.] 

(b) The converse of the result in (a) is a famous theorem of point-set topology 
called the Hahn-Mazurkiewicz theorem (see [H-Y}, p. 129). Assuming this 
theorem, show there is a continuous surjective map f : I > 1°. 

A Hausdorff space that is the continuous image of the closed unit interval is 
often called a Peano space. 


$45 Compactness in Metric Spaces 


We have already shown that compactness, limit point compactness, and sequential 
compactness are equivalent for metric spaces. There is still another formulation of 
compactness for metric spaces, one that involves the notion of completeness. We 
study it in this section. As an application, we shall prove a theorem charactenzing 
those subspaces of C(X, R”) that are compact in the uniform topology. 

How is compactness of a metric space X related to completeness of X? It follows 
from Lemma 43.1 that every compact metric space is complete. The converse does not 
hold—a complete metric space need not be compact. It is reasonable to ask what extra 
condition one needs to impose on a complete space to be assured of its compactness. 
Such a condition is the one called total boundedness. 


Definition. A metric space (X, d) is said to be totally bounded if for every € > 0, 
there is a finite covering of X by e-balls. 


EXAMPLE tł. Total boundedness clearly implies boundedness. For if B(x}, 1/2),..., 
B(Xn, 1/2) is a finite covering of X by open balls of radius 1/2, then X has diameter at 
most | + max{d (x;, x;)}. The converse does not hold, however. For example, in the metric 
d(a, b) = min(1, |a — b|}, the real tine R is bounded but not totally bounded. 


EXAMPLE 2. Under the metnc d(a,b) = |a — b|, the real line R is complete but 
not totally bounded, while the subspace (—1, 1) is totally bounded but not complete. The 
subspace [—1, 1] is both complete and totally bounded. 
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Theorem 45.1. A metric space (X,d) is compact if and only if it is complete and 
totally bounded. 


Proof. If X is a compact metric space, then X is complete, as noted above. The fact 
that X is totally bounded is a consequence of the fact that the covering of X by all 
open €-balls must contain a finite subcovering. 

Conversely, let X be complete and totally bounded. We shall prove that X is 
sequentially compact. This will suffice. 

Let (xn) be a sequence of points of X. We shall construct a subsequence of (X,) 
that is a Cauchy sequence, so that it necessarily converges. First cover X by finitely 
many balls of radius 1. At least one of these bails, say B,, contains x, for infinitely 
many values of n. Let J; be the subset of Z, consisting of those indices n for which 
Xn € By. 

Next, cover X by finitely many balls of radius 1/2. Because Jı is infinite, at 
least one of these balls, say Bz, must contain x, for infinitely many values of n in J). 
Choose Jz to be the set of those indices n for which n € J, and x, € B2. In general, 
given an infinite set J, of positive integers, choose J, +, to be an infinite subset of J; 
such that there is a ball By, of radius 1/(k + 1) that contains x, for all n € Jk41.- 

Choose n; € Jı. Given ng, choose ng41 € Jk+ı such that ng; > ny; this we 
can do because J+; is an infinite set. Now fori, j > k, the indices n; and n, both 
belong to Jy (because J; D J2 D --- is a nested sequence of sets). Therefore, for all 
i, j = k, the points xn; and x, are contained in a ball By of radius 1/k. It follows that 
the sequence (x,,) is a Cauchy sequence, as desired. a 


We now apply this result to find the compact subspaces of the space C(X, R”), in 
the uniform topology. We know that a subspace of R” is compact if and only if it is 
closed and bounded. One might hope that an analogous result holds for C (X, R”). But 
it does not, even if X is compact. One needs to assume that the subspace of @(X, R") 
satisfies an additional condition, called equicontinuity. We consider that notion now. 


Definition. Let (Y,d) be a metric space. Let F be a subset of the function space 
C(X, Y). If xo € X, the set F of functions is said to be equicontinuous at xọ if given 
€ > 0, there is a neighborhood U of xo such that for all x € U and all f € F, 


d(f (x), f(xo)) < €. 


If the set F is equicontinuous at xo for each xo € X, it is said simply to be equicon- 
tinuous. 


Continuity of the function f at xo means that given f and given e > 0, there exists 
a neighborhood U of xo such that d( f(x), f(xo)) < € forx € U. Equicontinuity 
of F means that a single neighborhood U can be chosen that will work for all the 
functions f in the collection F. 

Note that equicontinuity depends on the specific metric d rather than merely on 
the topology of Y. 
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Lemma 45.2. Let X be a space; let (Y,d) be a metric space. If the subset F 
of C(X, Y) is totally bounded under the uniform metric corresponding tod, then F is 
equicontinuous under d. 


Proof. Assume F is totally bounded. Given O < € < 1, and given xo, we find a 
neighborhood U of xo such that d(f (x), f (xo) < e forx E€ U and f E€ F. 
Set 6 = €/3; cover F by finitely many open 6-balls 


B(fi, ),.-.. BUfn. 8) 


in @(X, Y). Each function f; is continuous; therefore, we can choose a neighbor- 
hood U of xg such that for? = 1,..., n, 


d(fi(x), filto)) < 6 


whenever x € U. 
Let f be an arbitrary element of F. Then f belongs to at least one of the above 
6-balls, say to B( fi, ô). Then for x € U, we have 


d( f(x), filx)) <6, 
d(fi(x), fil%o)) <8 
d(fi(xo), f (X0)) < ô. 


The first and third inequalities hold because p(f, fi) < ô, and the second holds be- 
cause x € U. Since ô < 1, the first and third also hold if d is replaced by d; then the 
triangle inequality implies that for all x € U, we have d(f (x), f (xo)) < €, as desired. 

u 


Now we prove the classical version of Ascoli’s theorem, which concerns compact 
subspaces of the function space C(X, R”). A more general version, whose proof does 
not depend on this one, is given in §47. The general version, however, relies on the 
Tychonoff theorem, whereas this one does not. 

We begin by proving a partial converse to the preceding lemma, which holds 
when X and Y are compact. 


*Lemma 45.3. Let X be a space; let (Y, d) be a metric space; assume X and Y are 
compact. If the subset F of C(X, Y) is equicontinuous under d, then F is totally 
bounded under the uniform and sup metrics corresponding to d. 


Proof. Since X is compact, the sup metric p is defined on C(X, Y). Total bounded- 
ness under p is equivalent to total boundedness under J, for whenever € < 1, every 
€-ball under p is also an €-ball under 6, and conversely. Therefore, we may as well 
use the metric p throughout. 

Assume F is equicontinuous. Given € > 0, we cover F by finitely many sets that 
are open €-balls in the metric p. 
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Set 6 = €/3. Given any a € X, there is a corresponding neighborhood Ua of a 
such that d( f(x), f(a)) < ô for all x € Ua and all f € F. Cover X by finitely 
many such neighborhoods Ug, for a = a, ..., ag, denote Ua, by U;. Then cover Y by 


finitely many open sets V,..., Vm of diameter less than ô. 
Let J be the collection of all functions a : {1,...,k} — {l,..., m}. Givena € J, 
if there exists a function f of F such that f (a;i) € Vai) for each i = 1,..., k, choose 


one such function and label it fa. The collection { fa} is indexed by a subset J’ of the 
set J and is thus finite. We assert that the open balls Bo( fa, €), fora € J’, cover F. 


Let f be an element of F. For each i = 1, ..., k, choose an integer a(i) such 
that f(a;) E€ Vai). Then the function æ is in J’. We assert that f belongs to the ball 
Bpl fa, €)- 


Let x be a point of X. Choose i so that x € U,. Then 


d(f (x), f(a)) < 4, 
d(f (ai), fa(a:)) < ô, 
d( falai), fa(x)) < ô. 


The first and third inequalities hold because x € U;, and the second holds because 
f(ai) and falai) are in Vagi. We conclude that d( f(x), fa(x)) < €. Because this 
inequality holds for every x € X, 


e(f, fa) = max{d( f (x), fa(x))} < €. 
Thus f belongs to B, (fa, €), as asserted. a 


Definition. If (Y,d) is a metric space, a subset F of C(X, Y) is said to be pointwise 
bounded under d if for each x € X, the subset 


Fa ={f(a)| fe F} 
of Y is bounded under d. 


*Theorem 45.4 (Ascoli’s theorem, classical version). Let X be a compact space; 
let (R", d) denote euclidean space in either the square metric or the euclidean metric; 
give C(X, R”) the corresponding uniform topology. A subspace F of C(X,R") has 
compact closure if and only if F is equicontinuous and pointwise bounded under d. 


Proof. Since X is compact, the sup metric p is defined on C(X, R”) and gives 
the uniform topology on C(X, R”). Throughout, let § denote the closure of F in 
C(X, R"). 

Step I. We show that if 9 is compact, then 9 is equicontinuous and pointwise 
bounded under d. Since F C $, it follows that F is also equicontinuous and pointwise 
bounded under d. This proves the “only if” part of the theorem. 

Compactness of 9 implies that 9 is totally bounded under p and 6 by Theo- 
rem 45.1; this in turn implies that 9 is equicontinuous under d, by Lemma 45.2. Com- 
pactness of 9 also implies that 9 is bounded under p; this in turn implies that 9 is 
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pointwise bounded under d. For if o(f,g) < M for all f, g € ġ, then in particular 
d(f(a), g(a)) < M for f, g € G, so that Qa has diameter at most M. 


Step 2. We show that if F is equicontinuous and pointwise bounded under d, then 
so is 9. 

First, we check equicontinuity. Given x9 € X and given € > 0, choose a neigh- 
borhood U of xo such that d( f(x), f(xo)) < €/3 for all x € U and f € F. Given 
g E G, choose f € F so that p(f.g) < €/3. The triangle inequality implies that 
d(g(x), g(xo)) < € for all x € U. Since g is arbitrary, equicontinuity of 9 at xo 
follows. 

Second, we verify pointwise boundedness. Given a, choose M so that diam F, < 
M. Then, given g, g’ € 3, choose f, f’ € F such that p(f,g) < land p(f’,g') < 1. 
Since d( f(a), f’(a)) < M, it follows that d(g(a), g'(a)) < M +2. Then since g 
and g’ are arbitrary, it follows that diam ĝa < M + 2. 


Step 3. We show that if 3 is equicontinuous and pointwise bounded, then there is 
a compact subspace Y of R” that contains the union of the sets g(X), for g € §. 

Choose, for each a € X, a neighborhood Ua of a such that d(g(x), g (a)) < l 
for x € Ua and g € g. Since X is compact, we can cover X by finitely many such 
neighborhoods, say for a = a),..., ag. Because the sets $4, are bounded, their union 
is also bounded; suppose it lies in the ball of radius N in R” centered at the ongin. 
Then for all g € ġ, the set g(X) is contained in the ball of radius N + 1 centered at 
the origin. Let Y be the closure of this ball. 


Step 4. We prove the “if” part of the theorem. Assume that F is equicontinuous 
and pointwise bounded under d. We show that is complete and totally bounded 
under p; then Theorem 45.1 implies that 9 is compact. 

Completeness is easy, for 9 is a closed subspace of the complete metric space 
(C(X, R”), p). 

We verify total boundedness. First, Step 2 implies that 9 is equicontinuous and 
pointwise bounded under d; then Step 3 tells us that there is a compact subspace Y 
of R” such that 9 C C(X, Y). Equicontinuity of $ now implies, by Lemma 45.3, that 
G is totally bounded under p, as desired. a 


*Corollary 45.5. Let X be compact; let d denote either the square metric or the 
euclidean metric on R"; give C(X, R") the corresponding uniform topology. A sub- 
space F of C(X, R") is compact if and only if it is closed, bounded under the sup 
metric p, and equicontinuous under d. 


Proof. If F is compact, it must be closed and bounded; the preceding theorem im- 
plies that it is also equicontinuous. Conversely, if F is closed, it equals its closure 9; if 
it is bounded under p, it is pointwise bounded under d; and if it is also equicontinuous, 
the preceding theorem implies that it is compact. a 
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Exercises 
1. If X, is metrizable with metric dn, then 
D(x, y) = sup{di (xi. y;)/ i} 
is a metric for the product space X = []X,. Show that X is totally bounded 
under D if each X, is totally bounded under dn. Conclude without using the Ty- 
chonoff theorem that a countable product of compact metrizable spaces is com- 
pact. 
2. Let (Y, d) be a metric space; let F be a subset of C(X, Y). 


*7. 


(a) Show that if F is finite, then F is equicontinuous. 

(b) Show that if fn is a sequence of elements of C(X, Y) that converges uni- 
formly, then the collection { /,} is equicontinuous. 

(c) Suppose that F is a collection of differentiable functions f : R > R such 
that each x € R lies in a neighborhood U on which the derivatives of the 
functions in F are uniformly bounded. [This means that there is an M such 
that | f’(x)| < M for all f in F and all x € U.] Show that F is equicontin- 
uous. 


. Prove the following: 


Theorem (Arzela’s theorem). Let X be compact; let fa € C(X, RÝ). If the 
collection { fn} is pointwise bounded and equicontinuous, then the sequence fy, 
has a uniformly convergent subsequence. 


. (a) Let fn : J —> R be the function f,(x) = x". The collection F = {fa} 


is pointwise bounded but the sequence (f,,) has no uniformly convergent 
subsequence; at what point or points does F fail to be equicontinuous? 
(b) Repeat (a) for the functions f,, of Exercise 9 of §21. 


. Let X be a space. A subset F of C(X, R) is said to vanish uniformly at infinity 


if given € > 0, there is a compact subspace C of X such that | f(x)| < € for 
x€ X-—Cand f e F. If F consists of a single function f, we say simply 
that f vanishes at infinity. Let Co(X, R) denote the set of continuous functions 
f : X — R that vanish at infinity. 
Theorem. Let X be locally compact Hausdorff; give Co(X, R) the uniform 
topology. A subset F of Co(X, R) has compact closure if and only if it is point- 
wise bounded, equicontinuous, and vanishes uniformly at infinity. 

(Hint: Let Y denote the one-point compactification of X. Show that Co(X, R) 
is isometne with a closed subspace of @(Y, R) if both are given the sup metric.] 


. Show that our proof of Ascoli’s theorem goes through if R” is replaced by any 


metric space in which all closed bounded subspaces are compact. 


Let (X,d) be a metric space. If A C X ande > 0, let U(A, €) be the e€- 
neighborhood of A. Let # be the collection of all (nonempty) closed, bounded 
subsets of X. If A, B € H, define 


D(A, B) = inf{e | A C U(B,€) and B C U(A, €)}. 
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(a) Show that D is a metric on X; it is called the Hausdorff metric. 

(b) Show that if (X, d) is complete, so is (#, D). [Hint: Let A, be a Cauchy 
sequence in Jf; by passing to a subsequence, assume D(A, Anit) < 1/2". 
Define A to be the set of all points x that are the limits of sequences x1, x2, 

. such that x; € A; for each i and d(x;, xj41) < 1/2!. Show A, > A.] 

(c) Show that if (X, d) is totally bounded, so is (#, D). [Hint: Given €, choose 
ô < € and let S be a finite subset of X such that the collection { Bg(x, 8) | 
x € S} covers X. Let A be the collection of all nonempty subsets of S; show 
that {Bn(A, €) | A € A} covers #.] 

(d) Theorem. If X is compact in the metric d, then the space # is compact in 
the Hausdorff metnc D. 

*8. Let (X, dy) and (Y, dy) be metric spaces; give X x Y the corresponding square 
metric; let # denote the collection of all nonempty closed, bounded subsets of 
X x Y in the resulting Hausdorff metric. Consider the space C(X, Y) in the 
uniform metric; let gr : C(X, Y) - H be the function that assigns, to each 
continuous function f : X — Y, its graph 


Gj = (ex f(x) |x € X). 


(a) Show that the map gr is injective and uniformly continuous. 

(b) Let Ho denote the image set of the map gr; let g : C(X, Y) — Ho be the 
surjective map obtained from gr. Show that if f : X — Y is uniformly 
continuous, then the map g~! is continuous at the point G fe 

(c) Give an example where g7! is not continuous at the point G f- 


(d) Theorem. If X is compact, then gr : C(X, Y) > H is an imbedding. 


§46 Pointwise and Compact Convergence 


There are other useful topologies on the spaces Y* and C(X, Y) in addition to the 
uniform topology. We shall consider three of them here; they are called the topology 
of pointwise convergence, the topology of compact convergence, and the compact-open 
topology. 


Definition. Given a point x of the set X and an open set U of the space Y, let 
S(x,U) = {f | f € ¥* and f(x) € U}. 

The sets S(x, U) are a subbasis for topology on Y¥, which is called the topology of 

pointwise convergence (or the point-open topology). 


The general basis element for this topology is a finite intersection of subbasis 
elements S(x, U). Thus a typical basis element about the function f consists of all 
functions g that are “close” to f at finitely many points. Such a neighborhood is 
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Figure 46.1 


illustrated in Figure 46.1; it consists of all functions g whose graphs intersect the three 
vertical intervals pictured. 

The topology of pointwise convergence on ¥~ is nothing new. It is just the product 
topology we have already studied. If we replace X by J and denote the general element 
of J by @ to make it look more familiar, then the set S(a, U ) of all functions x : J > Y 
such that x(a) € U is just the subset n,'(U) of Y7, which is the standard subbasis 
element for the product topology. 

The reason for calling it the topology of pointwise convergence comes from the 
following theorem: 


Theorem 46.1. A sequence fn of functions converges to the function f in the topol- 
ogy of pointwise convergence if and only if for each x in X, the sequence f,,(x) of 
points of Y converges to the point f (x). 


Proof. This result is just a reformulation, in function space notation, of a standard 
result about the product topology proved as Lemma 43.3. a 


EXAMPLE 1. Consider the space R’, where / = [0, 1}. The sequence ( fn) of continuous 
functions given by f,(x) = x” converges in the topology of pointwise convergence to the 
function f defined by 


0 forO0<x <1, 
1 forx = 1. 


fœ) = 


This example shows that the subspace C(/, R) of continuous functions is not closed in R’ 
in the topology of pointwise convergence. 
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We know that a sequence (fna) of continuous functions that converges in the uni- 
form topology has a continuous limit, and the preceding example shows that a se- 
quence that converges only in the topology of pointwise convergence need not. One 
can ask whether there is a topology intermediate between these two that will suffice 
to ensure that the limit of a convergent sequence of continuous functions is continu- 
ous. The answer is “yes”; assuming the (fairly mild) restriction that the space X be 
compactly generated, it will suffice if fa converges to f in the topology of compact 
convergence, which we now define. 


Definition. Let (Y,d) be a metric space; let X be a topological space. Given an 
element f of Y¥, a compact subspace C of X, and a number € > 0, let Bc(f, €) 
denote the set of all those elements g of Y¥ for which 


sup{d( f(x), g(x) |x €C) <e. 


The sets Bc(f, €) form a basis for a topology on Y%. It is called the topology of com- 
pact convergence (or sometimes the “topology of uniform convergence om compact 
sets”). 


It is easy to show that the sets Bc(f, €) satisfy the conditions for a basis. The 
crucial step is to note that if g € Bc( f, €), then for 


ô = € — supld (f (x), g(x)) | x € C}, 


we have Bc(g, 5) C Bc(f, €). 

The topology of compact convergence differs from the topology of point wise con- 
vergence in that the general basis element containing f consists of functions that are 
“close” to f not just at finitely many points, but at all points of some compact set. 

The justification for the choice of terminology comes from the following theorem, 
whose proof is immediate. 


Theorem 46.2. A sequence fn : X — Y of functions converges to the function f 
in the topology of compact convergence if and only if for each compact subspace C 
of X, the sequence f,|C converges uniformly to f{C. 


Definition. A space X is said to be compactly generated if it satisfies the following 
condition: A set A is open in X if AMC is open in C for each compact subspace C 
of X. 


This condition is equivalent to requiring that a set B be closed in X if BNC is 
closed in C for each compact C. It is a fairly mild restriction on the space; many 
familiar spaces are compactly generated. For instance: 


Lemma 46.3. If X is locally compact, or if X satisfies the first countability axiom, 
then X is compactly generated. 
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Proof. Suppose that X is locally compact. Let AN C be open in C for every compact 
subspace C of X. We show A is open in X. Given x € A, choose a neighborhood U 
of x that lies in a compact subspace C of X. Since AN C is open in C by hypothesis, 
ANU is open in U, and hence open in X. Then ANU is a neighborhood of x contained 
in A, so that A is open in X. 

Suppose that X satisfies the first countability axiom. If B N C is closed in C for 
each compact subspace C of X, we show that B is closed in X. Let x be a point of B; 
we show that x € B. Since X has a countable basis at x, there is a sequence (x,) of 
points of B converging to x. The subspace 


C = {x} U(x, | n € Z4} 


is compact, so that B N C is by assumption closed in C. Since B N C contains x, for 
every n, it contains x as well. Therefore, x € B, as desired. a 


The crucial fact about compactly generated spaces is the following: 


Lemma 46.4. If X is compactly generated, then a function f : X — Y is continuous 
if for each compact subspace C of X, the restricted function f |C is continuous. 


Proof. Let V be an open subset of Y; we show that f~'(V) is open in X. Given any 
subspace C of X, 


F'O NAC =(FIC)"(V). 


If C is compact, this set is open in C because f |C is continuous. Since X is compactly 
generated, it follows that f7! (V) is open in X. a 


Theorem 46.5. Let X be a compactly generated space: let (Y, d) be a metric space. 
Then @(X, Y) is closed in Y* in the topology of compact convergence. 


Proof. Let f € Y* bea limit point of @(X, Y); we wish to show f is continuous. 
It suffices to show that f|C is continuous for each compact subspace C of X. For 
each n, consider the neighborhood Bc(f, 1/n) of f; it intersects C(X, Y), so we can 
choose a function fa € C(X, Y) lying in this neighborhood. The sequence of functions 
falC : C — Y converges uniformly to the function fjC, so that by the uniform limit 
theorem, f|C is continuous. a 


Corollary 46.6. Let X be a compactly generated space; let (Y, d) be a metric space. 
If a sequence of continuous functions fa : X — Y converges to f in the topology of 
compact convergence, then f is continuous. 


Now we have three topologies for the function space Y* when Y is metric. The 
relation between them is stated in the following theorem, whose proof is straightfor- 
ward. 
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Theorem 46.7. Let X be a space; let (Y, d) be a metric space. For the function 
space ¥*, one has the following inclusions of topologies: 


(uniform) D (compact convergence) D (pointwise convergence). 


If X is compact, the first two coincide, and if X is discrete, the second two coincide. 


Now the definitions of the uniform topology and the compact convergence topol- 
ogy made specific use of the metric d for the space Y. But the topology of pointwise 
convergence did not; in fact, it is defined for any space Y. It is natural to ask whether 
either of these other topologies can be extended to the case where Y is an arbitrary 
topological space. There is no satisfactory answer to this question for the space Y* 
of all functions mapping X into Y. But for the subspace C(X, Y) of continuous func- 
tions, one can prove something. It turns out that there is in general a topology on 
C(X, Y), called the compact-open topology, that coincides with the compact conver- 
gence topology when Y is a metric space. This topology is important in its own right, 
as we shall see. 


Definition. Let X and Y be topological spaces. If C is a compact subspace of X 
and U is an open subset of Y, define 


S(C,U) =(f | f € C(X, Y) and f(C) CU}. 


The sets S(C, U) form a subbasis for a topology on C(X, Y ) that is called the compact- 
open topology. 


It is clear from the definition that the compact-open topology is finer than the 
pointwise convergence topology. The compact-open topology can in fact be defined 
on the entire function space Y*. It is, however, of interest only for the subspace 
C(X, Y), so we shall consider it only for that space. 


Theorem 46.8. Let X bea space and let (Y, d) be a metric space. On the set C (X, Y), 
the compact-open topology and the topology of compact convergence coincide. 


Proof. If A is a subset of Y and € > 0, let U (A, €) be the €-neighborhood of A. 
If A is compact and V is an open set containing A, then there is an € > O such 
that U(A,€) C V. Indeed, the minimum value of the function d(a, X — V) is the 
required €. 

We first prove that the topology of compact convergence is finer than the compact- 
open topology. Let S(C, U) be a subbasis element for the compact-open topology, and 
let f be an element of S(C, U). Because f is continuous, f(C) is a compact subset 
of the open set U. Therefore, we can choose € so that €-neighborhood of f(C) lies in 
U. Then, as desired, 


Bc(f,€) C S(C, U). 


Now we prove that the compact-open topology is finer than the topology of com- 
pact convergence. Let f € C(X, Y). Given an open set about f in the topology of 
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compact convergence, it contains a basis element of the form Bc(f, €). We shall find 
a basis element for the compact-open topology that contains f and lies in Bc(f, €). 

Each point x of X has a neighborhood V, such that f(V;) lies in an open set Uy 
of Y having diameter less than e. [For example, choose Vy so that f(V;) lies in 
the ¢/4-neighborhood of f(x). Then f(V,) lies in the ¢/3-neighborhood of f(x), 
which has diameter at most 2¢/3.] Cover C by finitely many such sets Vy, say for 
xX =X1,...,X,. Let Cx = Vy NC. Then C, is compact, and the basis element 


S(Cxy. Ux) +--+ S(Cy,, Uxa) 


contains f and lies in Bc(f, €), as desired. a 


Corollary 46.9. Let Y be a metric space The compact convergence topology on 
C(X, Y) does not depend on the metric of Y. Therefore if X is compact, the uniform 
topology on C(X, Y) does not depend on the metric of Y. 


The fact that the definition of the compact-open topology does not involve a met- 
ric is just one of its useful features. Another is the fact that it satisfies the require- 
ment of “joint continuity.” Roughly speaking, this means that the expression f(x) is 
continuous not only in the single “vanable” x, but is continuous jointly in both the 
“variables” x and f More precisely, one has the following theorem: 


Theorem 46.10. Let X be locally compact Hausdorff; let C(X, Y) have the compact- 
open topology. Then the map 


e:Xxe(X,Y)~ Y 
defined by the equation 
e(x, f) = f(x) 


is continuous 


The map e is called the evaluation map. 


Proof. Given a point (x, f) of X x C(X, Y) and an open set V in Y about the image 
point e(x, f) = f(x), we wish to find an open set about (x, f) that e maps into V. 
First, using the continuity of f and the fact that X is locally compact Hausdorff, we 
can choose an open set U about x having compact closure U, such that f carries U 
into V. Then consider the open set U x SU, V) in X x C(X, Y). It is an open set 
containing (x, f). And if (x’, f’) belongs to this set, then e(x’, f’) = f'(x’) belongs 
to V, as desired. a 


A consequence of this theorem is the theorem that follows. It is useful in algebraic 
topology. 


§46 Pointwise and Compact Convergence 287 


Definition. Given a function f : X x Z — Y, there is a corresponding function 
F : Z —> C(X, Y), defined by the equation 


(F(z))(x) = f(x, z). 


Conversely, given F - Z — C(X, Y), this equation defines a corresponding function 
f:X x Z— Y. We say that F is the map of Z into C(X, Y) that is induced by f. 


*Theorem 46.11. Let X and Y be spaces; give C(X, Y) the compact-open topology. 
If f : X x Z — Y is continuous, then so is the induced function F : Z > @(X, Y). 
The converse holds if X is locally compact Hausdorff. 


Proof. Suppose first that F is continuous and that X is locally compact Hausdorff. It 
follows that f is continuous, since f equals the composite 

Xx ZS xx OXY), 
where ix is the identity map of X. 

Now suppose that f is continuous. To prove continuity of F, we take a point zo 
of Z and a subbasis element S(C, U) for C(X, Y) containing F (zo), and find a neigh- 
borhood W of zp that is mapped by F into S(C, U). This wilt suffice. 

The statement that F (zo) lies in S(C, U) means simply that (F (zo))(x) = f(x, zo) 
is in U for all x € C. That is, f(C x zo) C U. Continuity of f implies that f—!(U) 
is an open set in X x Z containing C x zo. Then 


f UNCC x Z) 


is an open set in the subspace C x Z containing the slice C x zp. The tube lemma of §26 
implies that there is a neighborhood W of zo in Z such that the entire tube C x W lies 
in fT} (U). See Figure 46.2. Then forz € W and x € C, we have f(x, z) € U. Hence 
F(W) c S(C, U), as desired. a 
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We discuss briefly the connections between the compact-open topology and the con- 
cept of homotopy, which arises in algebraic topology 

If f and g are continuous maps Of X into Y, we say that f and g are homotopic if 
there is a continuous map 


h X x[0,1]] — Y 


such that A(x, 0) = f(x) and A(x, 1) = g(x) for each x € X. The map A is called a 
homotopy between f and g. 

Roughly speaking, a homotopy is a “continuous one-parameter family” of maps from 
X to Y More precisely, we note that a homotopy / gives nse to a map 


H .(0,t] — C(x, Y) 


that assigns, to each parameter value ¢ in [0, 1], the corresponding continuous map from X 
to Y. Assuming that X is locally compact Hausdorff, we see that A is continuous if and only 
if H is continuous This means that a homotopy A between f and g corresponds precisely 
to a path in the function space C(X, Y) from the point f of C(X, Y) to the point g 

We shall return to a more detailed study of homotopy in Part II of the book. 


ercises 


Show that the sets Bc (f, €) form a basis for a topology on Y*. 
Prove Theorem 46.7. 


Show that the set B(R, R) of bounded functions f : R — R is closed in RE in 
the uniform topology, but not in the topology of compact convergence. 


. Consider the sequence of continuous functions fa : R — R defined by 
falx) =x/n. 
In which of the three topologies of Theorem 46.7 does this sequence converge? 


Answer the same question for the sequence given in Exercise 9 of §21. 
Consider the sequence of functions fn . (—1, 1) —> R, defined by 


fara) = L kx, 
k=l 


(a) Show that ( fn) converges in the topology of compact convergence; conclude 
that the limit function is continuous. (This is a standard fact about power 
senes.) 

(b) Show that ( f4) does not converge in the uniform topology 

. Show that in the compact-open topology, C(X, Y) is Hausdorff if Y is Hausdorff, 

and regular if Y is regular. (Hint: If U c V, then S(C, U) c S(C, V).] 


b 
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. Show that if Y is locally compact Hausdorff, then composition of maps 


C(X, Y) x C(Y, Z) — C(X, Z) 


is continuous, provided the compact-open topology is used throughout. [Hint: If 
go f e S(C, U), find V such that f(C) C V and g(V) C U] 


. Let C'(X, Y) denote the set C(X, Y) in some topology T. Show that if the 


evaluation map 
e:Xx@(X,Y)— Y 


is continuous, then F contains the compact-open topology. [Hint: The induced 
map E : C'(X, Y) > C(X, Y) is continuous.} 


. Here is an (unexpected) application of Theorem 46 11 to quotient maps. (Com- 


pare Exercise 11 of §29.) 

Theorem. If p: A — B is a quotient map and X is locally compact Hausdorff, 

theniy x p` X x A — X x B is a quotient map. 

Proof. 

(a) Let Y be the quotient space induced by iy x p; letg : X x A —> Y be the 
quotient map. Show there is a bijective continuous map f ` Y —> X x B 
such that f og = Ix x p. 

(b) Letg = f7'. LettG. B — @(X,Y) and Q : A > C(X,Y) be the maps 
induced by g and q, respectively. Show that Q = G o p. 

(c) Show that Q is continuous; conclude that G is continuous, so that g is con- 
tinuous. 

A space is locally compact if it can be covered by open sets each of which is 

contained in a compact subspace of X It is said to be o-compact if it can be 

covered by countably many such open sets. 

(a) Show that if X is locally compact and second-countable, it is o -compact. 

(b) Let (Y, d) be a metne space. Show that if X is o-compact, there is a met- 
nc for the topology of compact convergence on ¥* such that if (Y, d) is 
complete, Y ¥ is complete in this metric. [Hint: Let Aj, A2,. . be a count- 
able collection of compact subspaces of X whose intenors cover X. Let Y, 
denote the set of all functions from A; to Y, in the uniform topology. De- 
fine a homeomorphism of Y* with a closed subspace of the product space 
Y,x Yx --j 

Let (Y, d) be a metric space; let X be a space. Define a topology on C (X, Y) as 

follows: Given f € C(X, Y), and given a positive continuous function ô : X > 

R+ on X, let 


BCf, 8) = {8 į d( f(x), g(x)) < 8(x) for all x € X}. 


(a) Show that the sets B( f, 5) form a basis for a topology on C(X, Y). We call 
it the fine topology. 
(b) Show that the fine topology contains the uniform topology. 
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(c) Show that if X is compact, the fine and uniform topologies agree. 
(d) Show that if X is discrete, then C(X, Y) = Y* and the fine and box topolo- 
gies agree. 


$47 Ascoli’s Theorem 


Now we prove a more general version of Ascoli’s theorem. It characterizes the com- 
pact subspaces of C(X, Y) in the topology of compact convergence. The proof, how- 
ever, involves all three of our standard function space topologies: the topology of 
pointwise convergence, the topology of compact convergence, and the uniform topol- 


ogy. 


Theorem 47.1 (Ascoli’s theorem). Let X be a space and let (Y, d) be a metric space. 
Give @(X, Y) the topology of compact convergence; let F be a subset of C(X, Y). 
(a) If F is equicontinuous under d and the set 


Fa =(fla)| feF} 


has compact closure for each a € X, then F is contained in a compact subspace 
of @(X, Y). 
(b) The converse holds if X is locally compact Hausdorff. 


Proof of (a). Throughout, we give Y* the product topology, which is the same as 
the topology of pointwise convergence. Then Y* is a Hausdorff space. The space 
C(X, Y), which has the topology of compact convergence, is not a subspace of Y*. 
Let 9 be the closure of F in Y*. 

Step 1. We show that 9 is a compact subspace of Y*. Givena € X, let Ca denote 
the closure of Fa in Y; by hypothesis, Cg is a compact subspace of Y. The set F is 
contained in the product space 

lhe: 


aeX 


since this product by definition consists of all functions f : X — Y satisfying the 
condition f(a) € Ca for all a. This product space is compact, by the Tychonoff 
theorem; it is a closed subspace of the product space Y*. Because 9 equals the closure 
of F in ¥*, is contained in [] Ca, being closed, 9 is therefore compact. 

Step 2. We show that each function belonging to is continuous, and indeed that 9 
itself is equicontinuous under d. 

Given xg € X and € > 0, choose a neighborhood U of xq such that 


(*) d(f (x), f(xo)) < €/3 forall f € F and all x € U. 


We shall show that d(g(x), g(xo)) < € for all g € 9 and all x € U; it follows that & is 
equicontinuous. 
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Let g € § and let x be a point of U Define V, to be the subset of Y X, open in Y*, 
consisting of all elements h of Y* such that 


(+x) d(h(x),g(x)) <€/3 and d(h(xo), g(x0)) < €/3. 


Because g belongs to the closure of F, the neighborhood V, of g must contain an 
element f of F. Applying the trangle inequality to (*) and (**), it follows that 
d(g(x), 2(x0)) < €, as desired. 

Step 3. We show that the product topology on Y* and the compact convergence 
topology on C(X, Y) coincide on the subset 9. 

In general, the compact convergence topology is finer than the product topology. 
We prove that the reverse holds for the subset 9. Let g be an element of 9, and 
let Bc(g, €) be a basis element for the compact convergence topology on Y* that 
contains g. We find a basis element B for the pointwise convergence topology on Y *¥ 
that contains g such that 


[BN 9) c [Belg €) NG). 


Using equicontinuity of 9 and compactness of C, we can cover C by finitely many 
open sets U;,.. , Un of X, containing points x1, ..., Xa, respectively, such that for 
each i, we have 


d(g(x), g(x;:)) < €/3 


for x € U; and g € §. Then we define B to be the basis element for Y* defined by the 
equation 


B={h|he y* and d(h(xi), g(xi)) < €/3 for i=l, ..., n}. 


We show that if A is an element of BG, then A belongs to Bc(g, €). That is, we show 
that d(h(x), g(x)) < € for x € C. Given x € C, choose i so that x € U;. Then 


d(h(x), h(x;)) < €/3 and 
d(g(x), g(ai)) < €/3 


because x € U; and g, h € 9, while 


d(h(xj), g(xi)) < €/3 


because h € B. It follows from the triangle inequality that d(h(x), g(x)) < €, as 
desired. 
Step 4. We complete the proof. The set 9 contains F and is contained in @(X, Y). 
It is compact as a subspace of Y* in the product topology. By the result just proved, it 
is also compact as a subspace of C(X, Y) in the compact convergence topology. 
Proof of (b). Let H be a compact subspace of C(X, Y) that contains F. We show 
that H is equicontinuous and that Ha is compact for each a € X. It follows that F is 
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equicontinuous (since F C Jf), and that F, lies in the compact subspace Jfa of Y, so 
that F, is compact. 
To show Ha is compact, consider the composite of the map 


j:C(X,Y)— X x C(X, Y) 
defined by j(f) =a x f, and the evaluation map 
e:Xx@(xX,Y)>Y, 


given by the equation e(x x f) = f(x). The map j is obviously continuous, and the 
map e is continuous by Theorems 46.8 and 46.10. The composite eo j maps H to Ha; 
since # is compact, so is Ha. 

Now we show that # is equicontinuous at a, relative to the metric d. Let A bea 
compact subspace of X that contains a neighborhood of a. It suffices to show that the 
subset 


R=(fIAs f EH} 


of C(A, Y) is equicontinuous at a. 
Give C(A, Y) the compact convergence topology. We show that the restnction 
map 


r:@(X,¥) > C(A,Y) 


is continuous. Let f be an element of C(X, Y) and let B = Bc(f\A, €) be a basis 
element for C(A, Y) containing f|A, where C is a compact subspace of A. Then C 
is a compact subspace of X, and r maps the neighborhood Bc (f, €) of f in C(X, Y) 
into B. 

The map r maps # onto R; because H is compact, so is R. Now R is a subspace 
of C(A, Y); because A is compact, the compact convergence and the uniform topolo- 
gies on C(A, Y) coincide. It follows from Theorem 45.1 that R is totally bounded in 
the uniform metnc on C(A, Y); then Lemma 45.2 implies that R is equicontinuous 
relative to d. a 


An even more general version of Ascoli’s theorem may be found in [K] or [Wd]. 
There it is not assumed that Y is a metric space, but only that it has what is called a 
uniform structure, which is a generalization of the notion of metnc. 

Ascoli’s theorem has many applications in analysis, but these lie outside the scope 
of this book. See [K-F] for several such applications. 


Exercises 


1. Which of the following subsets of C(IR, R) are pointwise bounded? Which are 
equicontinuous? 
(a) The collection {fa}, where f,(x) = x + sinnx. 
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(b) The collection {g,}, where g,(x) = n + sinx. 
(c) The collection {hn}, where hn (x) = |x|”. 
(d) The collection {k,}, where k, (x) = n sin(x /n). 


. Prove the following: 


Theorem. If X is a locally compact Hausdorff space, then a subspace F of 
C(X, R") in the topology of compact convergence has compact closure if and 
only if F is pointwise bounded and equicontinuous under either of the standard 
metrics on R". 


. Show that the general version of Ascoli’s theorem implies the classical version 


(Theorem 45.4) when X is Hausdorff. 


. Prove the following: 


Theorem (Arzela’s theorem, general version). Let X be a Hausdorff space that 
is ø compact; let f, be a sequence of functions fa : X — R*. If the collec- 
tion { fa} is pointwise bounded and equicontinuous, then the sequence fa has a 
subsequence that converges, in the topology of compact convergence, to a con- 
tinuous function. 

[Hint: Show C(X, RÝ) is first-countable.} 


. Let (Y, d) be a metric space; let fa : X — Y be a sequence of continuous 


functions; let f : X — Y be a function (not necessarily continuous). Suppose fa 
converges to f in the topology of pointwise convergence. Show that if {fa} is 
equicontinuous, then f is continuous and fn converges to f in the topology of 
compact convergence. 


Chapter 8 


Baire Spaces and Dimension 
Theory 


In this chapter, we introduce a class of topological spaces called the Baire spaces. 
The defining condition for a Baire space is a bit complicated to state, but it is often 
useful in the applications, in both analysis and topology. Most of the spaces we have 
been studying are Baire spaces. For instance, a Hausdorff space is a Baire space if 
it is compact, or even locally compact. And a metrizable space X is a Baire space if 
it is topologically complete, that is, if there is a metric for X relative to which X is 
complete. 

It follows that, since the space C (X, R”) of all continuous functions from a space X 
to R” is complete in the uniform metric, it is a Baire space in the uniform topology. 
This fact has a number of interesting applications. 

One application is the proof we give in §49 of the existence of a continuous 
nowhere-differentiable real-valued function. 

Another application arises in that branch of topology called dimension theory. 
In §50, we define a topological notion of dimension, due to Lebesgue. And we prove 
the classical theorem that every compact metnzable space of topological dimension m 
can be imbedded in euclidean space RY of dimension N = 2m + 1. It follows that 
every compact m-manifold can be imbedded in R*"+! This generalizes the imbedding 
theorem proved in §36. 

Throughout the chapter, we assume familiarity with complete metne spaces (§43). 
When we study dimension theory, we shall make use of §36, Imbeddings of Manifolds, 
as well as a bit of linear algebra. 
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§48 Baire Spaces 


The defining condition for a Baire space is probably as “unnatural looking” as any 
condition we have yet introduced in this book. But bear with us awhile. 

In this section, we shall define Baire spaces and shall show that two important 
classes of spaces—the complete metric spaces and the compact Hausdorff spaces— 
are contained in the class of Baire spaces. Then we shall give some applications, 
which, even if they do not make the Baire condition seem any more natural, will at 
least show what a useful tool it can be. In fact, it turns out to be a very useful and 
fairly sophisticated tool in both analysis and topology. 


Definition. Recall that if A is a subset of a space X, the interior of A is defined as the 
union of all open sets of X that are contained in A. To say that A has empty interior is 
to say then that A contains no open set of X other than the empty set. Equivalently, A 
has empty interior if every point of A is a limit point of the complement of A, that is, 
if the complement of A is dense in X. 


EXAMPLE 1 The set Q of rationals has empty interior as a subset of R, but the interval 
{0, 1] has nonempty interior The interval (0, 1] x O has empty intenor as a subset of the 
plane R?, and so does the subset Q x R. 


Definition. A space X is said to be a Baire space if the following condition holds: 
Given any countable collection {A,} of closed sets of X each of which has empty 
intenor in X, their union (|) A, also has empty interior in X. 


EXAMPLE 2 The space Q of rationals is not a Baire space. For each one-point set in Q 
is closed and has empty intenor in Q; and Q is the countable union of its one-point subsets. 

The space Z,, on the other hand, does form a Baire space Every subset of Z is 
open, so that there exist no subsets of Z4 having empty intenor, except for the empty set. 
Therefore, Z, satisfies the Baire condition vacuously. 

More generally, every closed subspace of R, being a complete metric space, is a Baire 
space. Somewhat surpnsing is the fact that the irrationals in R also form a Baire space; see 
Exercise 6. 


The termunology originally used by R. Baire for this concept involved the word 
“category.” A subset A of a space X was said to be of the first category in X if it 
was contained in the union of a countable collection of closed sets of X having empty 
interiors in X; otherwise, it was said to be of the second category in X. Using this 
terminology, we can say the following: 


A space X is a Baire space if and only if every nonempty open set in X is 
of the second category 


We shall not use the terms “first category” and “second category” in this book. 

The preceding definition is the “closed set definition” of a Baire space. There 
is also a formulation involving open sets that is frequently useful. It is given in the 
following lemma. 
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Lemma 48.1. X is a Baire space if and only if given any countable collection {Un} 
of open sets in X, each of which is dense in X, their intersection (| Un is also dense 
in X. 


Proof. Recall that a set C is dense in X if Č = X. The theorem now follows at once 
from the two remarks: 

(1) A is closed in X if and only if X — A is open in X. 

(2) B has empty interior in X if and only if X — B is dense in X. a 


There are a number of theorems giving conditions under which a space is a Baire 
space. The most important is the following: 


Theorem 48.2 (Baire category theorem). If X is a compact Hausdorff space or a 
complete metric space, then X is a Batre space. 


Proof. Given a countable collection {A,} of closed set of X having empty interiors, 
we want to show that their union |_) A, also has empty interior in X. So, given the 
nonempty open set Up of X, we must find a point x of Up that does not lie in any of 
the sets A,. 

Consider the first set A}. By hypothesis, A; does not contain Up. Therefore, we 
may choose a point y of Uo that is not in A;. Regularity of X, along with the fact that 
A, is closed, enables us to choose a neighborhood U; of y such that 


Üi NÁ = 
U; C Up. 


If X is metric, we also choose U, small enough that its diameter is less than 1. 

In general, given the nonempty open set U,—,, we choose a point of Un-—1 that is 
not in the closed set A,, and then we choose U, to be a neighborhood of this point 
such that 


Un An = 
Ün C Un-1, 
diam Un < l/n in the metne case. 


We assert that the intersection f) U,, is nonempty. From this fact, our theorem will 
follow. For if x is a point of (1) Un, then x is in Up because Ü} C Up. And for each a, 
the point x is not in A, because U, is disjoint from An. 

The proof that f) Ü, is nonempty splits into two parts, depending on whether X 
is compact Hausdorff or complete metric. If X is compact Hausdorff, we consider 
the nested sequence Uj > Uz D -- of nonempty subsets of X. The collection {Un} 
has the finite intersection property; since X is compact, the intersection () U, must be 
nonempty. 

If X is complete metric, we apply the following lemma. a 
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Lemma 48.3. LetC, D C2 D- - bea nested sequence of nonempty closed sets in 
the complete metric space X. If diam Ca — 0, then()C, # Ø. 


Proof. We gave this as an exercise in §43. Here is a proof: Choose x, € C, for each 
n. Because Xn, Xm € Cy for n,m > N, and because diam Cy can be made less than 
any given € by choosing N large enough, the sequence (x,) ıs a Cauchy sequence. 


Suppose that it converges to x. Then for given k, the subsequence xz, xk41, --. also 
converges to x. Thus x necessarily belongs to C} = Cy. Then x € (Cx, as desired. 
a 


Here is one application of the theory of Baire spaces; we shall give further ap- 
plications in the sections that follow. This application is perhaps more amusing than 
profound. It concerns a question that a student might ask concerning convergent se- 
quences of continuous functions. 

Let fa : [0, 1} — R be a sequence of continuous functions such that f,(x) > 
f(x) for each x e [0, 1}. There are examples that show the limit function f need 
not be continuous. But one might wonder just how discontinuous f can be Could it 
be discontinuous everywhere, for instance? The answer is “no.” We shall show that 
f must be continuous at infinitely many points of [0, 1]. In fact, the set of points at 
which f is continuous is dense in [0, 1}! 

To prove this result, we need the following lemma: 


*Lemma 48.4. Any open subspace Y of a Baire space X is itself a Baire space. 


Proof. Let An be a countable collection of closed sets of Y that have empty interiors 
in Y. We show that |J A, has empty interior in Y. 

Let An be the closure of A, in X; then An NY = An. The set A, has empty 
intenor in X. For if U is a nonempty open set of X contained in Aj, then U must 
intersect An. Then U N Y is a nonempty open set of Y contained in A,, contrary to 
hypothesis 

If the union of the sets A, contains the nonempty open set W of Y, then the union 
of the sets A, also contains the set W, which is open in X because Y is open in X. But 
each set A, has empty intenor in X, contradicting the fact that X is a Baire space. Ml 


*Theorem 48.5. Let X be a space; let (Y, d) be a metnc space. Let fa : X > Y 
be a sequence of continuous functions such that f,(x) — f(x) forall x € X, where 
f :X — Y. IfX isa Baire space, the set of points at which f is continuous is dense 
in X. 
Proof. Given a positive integer N and given € > 0, define 

An(€) = {x | d(fn(x), fm(x)) < € for all n,m > N}. 


Note that Ay (€) is closed in X. For the set of those x for which d( f,(x), fn(x)) < € 
is closed in X, by continuity of fn and fm, and Ay(e) is the intersection of these sets 
for alln,m> N. 
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For fixed €, consider the sets A; (€) C A2(€) C --. The union of these sets 
is all of X. For, given x9 € X, the fact that f,(xo) —> f (xo) implies that the se- 
quence f, (xo) is a Cauchy sequence; hence xo € Aw(e) for some N. 

Now let 


Ue) = U Int Ay (€). 
NEZ, 
We shall prove two things: 
(1) U (e€) is open and dense in X. 
(2) The function f is continuous at each point of the set 


C=UA)NUG/2NU/3)N---. 


Our theorem then follows from the fact that X is a Baire space. 

To show that U (€) is dense in X, it suffices to show that for any nonempty open 
set V of X, there is an N such that the set VM Int Ay (€) is nonempty. For this purpose, 
we note first that for each N, the set V N Ay(e) is closed in V. Because V is a Baire 
space by the preceding lemma, at least one of these sets, say V NA m (€), must contain a 
nonempty open set W of V Because V is open in X, the set W is open in X; therefore, 
it is contained in Int Ay (€). 

Now we show that if x9 € C, then f is continuous at x9. Given € > 0, we shall 
find a neighborhood W of xo such that d( f(x), f(xo)) < € for x e W. 

First, choose k so that 1/k < €/3. Since x9 € C, we have xp € U(1/k); therefore, 
there is an N such that x9 € IntAw(1/k). Finally, continuity of the function fy 
enables us to choose a neighborhood W of xo, contained in Ay(1/k), such that 


(+) d(fu(x), fn (x0)) <€/3 forxeW. 
The fact that W C Ay(1/k) implies that 

d(fn(x), fu(x)) <1/k forn > N andx € W. 
Letting n > oo, we obtain the inequality 
(**) d( f(x), fu@)) < l/k<€/3 forxeW. 
In particular, since x9 € W, we have 
(#4) d(f (x0). fn (x0)) < €/3. 


Applying the triangle inequality to (*), (++), and (*+#x) gives us our desired result. $ 


Exercises 


1. Let X equal the countable union |) Ba. Show that if X is a nonempty Baire 
space, at least one of the sets B, has a nonempty interior. 
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. The Baire category theorem implies that R cannot be wntten as a countable union 


of closed subsets having empty interiors. Show this fails if the sets are not re- 
quired to be closed 


. Show that every locally compact Hausdorff space is a Baire space. 
. Show that if every point x of X has a neighborhood that is a Baire space, then X 


is a Bare space. [Hint: Use the open set formulation of the Bare condition.} 


. Show that if Y is a Gs set in X, and if X is compact Hausdorff or complete 


metric, then Y is a Baire space in the subspace topology. [Hint: Suppose that 
Y =()W,, where W, is open in X, and that B, is closed in Y and has empty 
interior in Y. Given Up open in X with Vo N Y Æ Ø, find a sequence of open 
sets U,, of X with U, N Y nonempty, such that 


Un Cc Un-1, 

Un N Bn =g, 

diam U, < l/n inthe metnc case, 
Un C Wn] 


. Show that the irrationals are a Baire space. 


. Prove the following: 


Theorem. If D is a countable dense subset of R, there is no function f : R > R 
that is continuous precisely at the points of D 
Proof. 

(a) Show that if f : R — R, then the set C of points at which f is continuous 
is a Gg set in R. [Hint: Let Un be the union of all open sets U of R such that 
diam f(U) < 1/n. Show that C = (}Un.} 

(b) Show that D is not a Gg set in R. (Hint: Suppose D = () Wp, where W, is 
open in R. For d € D, set Vg = R — {d}. Show W,, and Vy are dense in R.] 


. If fn is a sequence of continuous functions fa : R > R such that f,(x) —> f(x) 


for each x € R, show that f is continuous at uncountably many points of R. 


. Let g : Z} > Q be a bijective function; let x, = g(n). Define f : R — R as 


follows: 


fQn)=1/n forxa €Q, 

f(xy) =0 forx ¢Q 
Show that f is continuous at each irrational and discontinuous at each rational. 
Can you find a sequence of continuous functions fn coverging to f? 


Prove the following: 
Theorem (Uniform boundedness principle). Let X be a complete metric space, 
and let F be a subset of C(X, R) such that for eacha € X, the set 


Fa =(fa| fe F} 
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is bounded. Then there is a nonempty open set U of X on which the functions 
in F are uniformly bounded, that is, there is a number M such that | f (x)| < M 
forall x e U andall f e F. {Hint: Let Ay = (x; |f(x)| < N forall fe F}.J 
11. Determine whether or not Rg is a Baire space 
12. Show that R’ is a Baire space in the box, product, and uniform topologies. 

*13. Let X be a topological space; let Y be a complete metnc space. Show that 
C(X, Y) is a Baire space in the fine topology (see Exercise 11 of §46). [Hint: 
Given basis elements B(f,, ôi) such that 6; < 1 and 6;,) < 6;/3 and fj,, € 
BC fi, 6;/3), show that 


B. 5) #2.) 


*§49 A Nowhere-Differentiable Function 


We prove the following result from analysis: 


Theorem 49.1. Leth : [0,1] — R be a continuous function. Given € > 0, there is 
a function g : (0, 1} —> R with |A(x) — g(x)| < e for all x, such that g is continuous 
and nowhere differentiable. 


Proof. Let I = (0, 1}. Consider the space C = C(1, R) of continuous maps from | 
to R, in the metric 


pí f.g) = max{| f(x) — g(x}. 


This space is a complete metric space and, therefore, is a Baire space. We shall define, 
for each n, a certain subset U, of C that is open in C and dense in C, and has the 
property that the functions belonging to the intersection 


ale 


neZ, 


are nowhere differentiable. Because C is a Baire space, this intersection is dense in C, 
by Lemma 48.1. Therefore, given h and e, this intersection must contain a function g 
such that p(h, g) < €. The theorem follows. 

The tricky part is to define the set U, properly. We first take a function f and 
consider its difference quotients. Given x € / and given 0 < h < 4, consider the 


expressions 


f(x +h) - f(x) f(x —h) - f(x) 
eg fle ee age aa? 


Since h < 4, at least one of the numbers x + h and x — h belongs to /, so that at least 


one of these expressions is defined. Let Af (x, h) denote the larger of the two if both 
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are defined; otherwise, let it denote the one that is defined. If the denvative f'(x) of f 
at x exists, it equals the limit of these difference quotients, so that 


IfGol= jim Af (x, h). 


We seek to find a continuous function for which this limt does not exist. To be specific, 
we shall construct f so that given x, there is a sequence of numbers h, converging to 0 
for which the numbers A f (x, hn) become arbitrarily large. 

This gives us the idea for defining the set U,,. Given any positive number h < 1/2, 
let 


Arf = inf{Af(x,h) |x El} 


Then forn > 2, we define U, by declaring that a function f belongs to U, if and only 
if for some positive number h < 1/n, we have Ay f >n. 


EXAMPLE 1 Let œ > 0 be given. The function f . (0, 1] — R given by the equation 
f(x) = 4ax(1 — x), whose graph is a parabola, satisfies the condition Af(x, A) > a for 
h = 1/4 and all x, as you can check Geometrically speaking, what this says is that for 
each x, at least one of the indicated secant lines of the parabola in Figure 49.1 has slope of 
absolute value at least a. Hence if a > 4, the function f belongs to U4 The function g 
pictured in Figure 49.1 satisfies the condition Ag(x,h) > œ for any h < 1/4; hence g 
belongs to Un provided œ > n. The function k satisfies the condition k(x, h) > a for any 
h < 1/8; hence k belongs to Un if æ > n. 


wie 


Q 
ale 
x 


Figure 49.1 


Now we prove the following facts about the set Un: 
(1) 1} Un consists of nowhere-differentiable functions. Let f € (| Un. We shall 
prove that given x in [0, 1}, the limit 


lim Af (x, h) 


does not exist: Given n, the fact that f belongs to U, means that we can find a num- 
ber h, with O < hn < 1/n such that 


Af (x, ha) >n. 
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Then the sequence (hn) converges to zero, but the sequence (Af(x,A,)) does not 
converge. As aresult, f is not differentiable at x. 


(2) Un is open in C. Suppose that f € Un; we find a 5-neighborhood of f that is 
contained in U,. Because f € Up, there is a number h with O < k < 1/n such that 
Arf > n. Set M = Ag f, and let 


8 =h(M —n)/4. 
We assert that if g is a function with p(f, g) < ô, then 
Ag(x,h) > 4(M +n) >n 


for all x € /, so that g € U,. 
To prove the assertion, let us first assume that Af (x, h) is equal to the quotient 
|f(x +h) — f(x)|/ hk. We compute 


fa +h)- fa) gath)—s@)| _ 
h h F 
C/A) +h) -ga +A) — Ifa- gols 25/h = (M — n)/2. 


If the first difference quotient is at least M in absolute value, then the second is in 
absolute value at least 


M —3(M —n) = §(M +n). 


A similar remark applies if Af (x, h) equals the other difference quotient. 


(3) Un is dense in C. We must show that given f in C, given € > 0, and given n, 
we can find an element g of U, within € of f. 

Choose a > n. We shall construct g as a “piecewise-linear” function, that is, a 
function whose graph is a broken line segment; each line segment in the graph of g 
will have slope at least œ in absolute value. It follows at once that such a function g 
belongs to Un. For let 


O=x <x, cx < <y =i 


be a partition of the interval [0, 1} such that the restriction of g to each subinterval 
1; = [xj~1, xi] is a linear function. Then choose h so that A < 1/n and 


h<4dmin{le; — 4-1]; i=), k). 


If x is in [0, 1}, then x belongs to some subinterval 7;. If x belongs to the first half of 
the subinterval /;, then x + A belongs to J; and (g(x + hk) — g(x))/h equals the slope 
of the linear function g|}; Similarly, if x belongs to the second half of /;, then x — h 
belongs to /; and (g(x — h) — g(x))/(—h) equals the slope of g|/;. In either case, 
Ag(x,h) 2 a, so g € Un, as desired. 
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Now given f, €, and æ, we must show how to construct the desired piecewise- 
linear function g. First, we use uniform continuity of f to choose a partition of the 
interval 


O=H << --<tm=l 


having the property that f vanes by at most €/4 on each subinterval [t;_1, t] of this 
partition. For each i = 1, ..., m, choose a point a; € (ti—1, ti). We then define a 
piecewise-linear function g; by the equations 


f@-v for x € [fi-1, ai}, 


T fi-1) + mi(x—ai) forx € fai, t}, 


where m; = (f(t) — f(ti-1))/(ti — ai). The graphs of f and gı are pictured in 
Figure 49.2. 


Figure 49.2 


We have some freedom of choice in choosing the point a;. If f (ti) Æ f(ti-1), we 
require a; to be close enough to t; that 


Ift) — fti- 
eUT AIN 


Qa 


h — Qj 


Then the graph of g; will consist entirely of line segments of slope zero and line 
segments of slope at least a in absolute value. 

Furthermore, we assert that p(g1, f) < €/2° On the interval J;, both g(x) 
and f(x) vary by at most €/4 from f (ti--1); therefore, they are within €/2 of each 
other. Then p(gi, f) = max{|gi(x) — f(x)I} < €/2. 
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Figure 49.3 


The function g; is not yet the function we want. We now define a function g by 
replacing each honzontal line segment in the graph of g; by a “sawtooth” graph that 
lies within €/2 of the graph of g, and has the property that each edge of the sawtooth 
has slope at least œ in absolute value. We leave this part of the construction to you. 
The result is the desired piecewise-linear function g. See Figure 49.3. | 


You may find this proof frustrating, in that it seems so abstract and noncon- 
structive. Implicit in the proof, however, is a procedure for constructing a specific 
sequence f, of piecewise-linear functions that converges uniformly to the nowhere- 
differentiable function f. And defining the function f in this way is just as construc- 
tive as the usual definition of the sine function, for instance, as the limut of an infinite 
senes. 


Exercises 


1. Check the stated properties of the functions f, g, and k of Example L. 


2. Given n and e, define a continuous function f : Z —> R such that f € U, and 
|f (x)| < € for all x. 
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We showed in §36 that if X is a compact manifold, then X can be imbedded in RY 
for some positive integer N. In this section, we generalize this theorem to arbitrary 
compact metnzable spaces 
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We shall define, for an arbitrary topological space X, a notion of topological di- 
mension. It is the “covering dimension” originally defined by Lebesgue We shall 
prove that each compact subset of R” has topological dimension at most m. We shall 
also prove that the topological dimension of any compact m-manifold is at most m. (It 
is, in fact, precisely m, but this we shall not prove.) 

The major theorem of this section is the theorem, due to K. Menger and G. Nobel- 
ing, that any compact metrizable space of topological dimension m can be imbedded 
in RY for N = 2m + 1. The proof is an application of the Baire theorem. It follows 
that every compact m-manifold can be imbedded in R2"+!, It follows also that a 
compact metrizable space can be imbedded in R” for some N if and only if it has 
finite topological dimension. 

Much of what we shall do holds without requiring the space in question to be 
compact. But we shall restrict ourselves to that case whenever it is convenient to do 
so. Generalizations to the noncompact case are given in the exercises. 


Definition. A collection A of subsets of the space X is said to have order m + | if 
some point of X lies in m + 1 elements of A, and no point of X lies in more than m + J 
elements of A. 


Now we define what we mean by the topological dimension of a space X Recall 
that given a collection A of subsets of X, a collection B is said to refine A, or to be 
a refinement of A, if for each element B of B there is an element A of Æ such that 
BCA. 


Definition. A space X is said to be finite dimensional if there is some integer m such 
that for every open covering A of X, there is an open covering 8 of X that refines A 
and has order at most m + 1. The topological dimension of X is defined to be the 
smallest value of m for which this statement holds; we denote it by dim X. 


EXAMPLE |. Any compact subspace X of R has topological dimension at most \. We 
begin by defining an open coverng of R of order 2 Let A, denote the collection of all open 
intervals of the form (n, n + 1) in R, where n is an integer. Let Ag denote the collection of 
all open intervals of the form (n — 1/2, n + 1/2), for n an integer. Then A = Ag U A) is 
an open covenng of R by sets of diameter one Because no two elements of Ao intersect, 
and no two elements of A, intersect, A has order 2. 

Now let X be a compact subspace of R. Given a covering C of X by sets open in X, 
this covering has a positive Lebesgue number 5. This means that any collection of subsets 
of X that have diameter less than ô is automatically a refinement of C. Consider the home- 
omorphism f R — R defined by f(x) = (48)x The images under f of the elements of 
the collection A form an open covering of R of order 2 whose elements have diarneter 56 ; 
their intersections with X form the required open covering of X. 


EXAMPLE 2. The interval X = [0, 1} has topological dimension 1. We know that 
dim X < 1. To show equality holds, let A be the covering of X by the sets [0, 1) and (0, 1]. 
We show that if B is any open covering of X that refines A, then B has order at least 2. 
Since B refines A, it must contain more than One element Let U be one of the elements 


306 Bare Spaces and Dimension Theory Ch. 8 


of B and let V be the union of the others. If 8 had order 1, then the sets U and V would 
be disjoint and would thus form a separation of X. We conclude that 8 has order at least 2. 


EXAMPLE 3. Any compact subspace X of R? has topological dimension at most 2. To 
prove this fact, we construct a certain open covering A of R? that has order 3 We begin by 
defining Az to be the collection of all open unit squares in R? of the following form: 


Az = {(n,n + 1) x (m,m +1) | n, m integers} 


Note that the elements of 2 are disjoint. Then, we define a collection Æ; by taking each 
(open) edge e of one of these squares, 


e=(n}x(mm+1) or e=(n,n+1) x {m}, 


and expanding it slightly to an open set U, of R?, being careful to ensure that if e £ e’, 
the sets Ue and Uy are disjoint We also choose each U, so that its diameter is at most 2. 
Finally, we define Ao to be the collection consisting of all open bails of radius A about the 
points n x m. See Figure 50.1 

The collection of open sets A = Az U A, U Apo covers R? Each of its elements has 
diameter at most 2. And it has order 3, since no point of R? can lie in more than one set 
from each A,. 


Figure 50.1 


Now let X be a compact subspace of R? Given an open covering of X, it has a 
positive Lebesgue number 8. Consider the homeomorphism f - R? > R? defined by the 
equation f(x) = (6/3)x. The images under f of the open sets of the collection A form 
an open covering of R? by sets of diameter less than 6, their intersections with X form the 
required open covering of X. 

We shall generalize this result to compact subsets of R” shortly. 


Some basic facts about topological dimension are given in the following theorems: 


Theorem 50.1. Let X be a space having finite dimension. If Y is a closed subspace 
of X, then Y has finite dimension and dim Y < dim X. 

Proof. Let dimX = m. Let A be a covering of Y by sets open in Y. For each 
A € A, choose an open set A’ of X such that A’ N Y = A. Cover X by the open 
sets A’, along with the open set X — Y. Let B be a refinement of this covering that is 
an open covering of X and has order at most m + 1. Then the collection 


[BN¥| Be} 


is a covering of Y by sets open in Y, it has order at most m + l, and it refines A. W 
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Theorem 50.2. Let X = Y U Z, where Y and Z are closed subspaces of X having 
finite topological dimension. Then 


dim X = max{dim Y, dim Z}. 


Proof. Let m = max{dim Y, dim Z}. We shall show that X is finite dimensional and 
has topological dimension at most m. It then follows from the preceding theorem that 
X has topological dimension precisely m. 


Step 1. If A is an open covering of X, we say that A has order at most m + 1 at 
points of Y provided no point of Y lies in more than m + 1 elements of A. 

We show that if A is an open covenng of X, then there is an open covernng of X 
that refines A and has order at most m + 1 at points of Y. 

To prove this fact, consider the collection 


{ANY | AeA}. 


It is an open covering of Y, so it has a refinement B that is an open covenng of Y 
and has order at most m + 1. Given B € B, choose an open set Ug of X such that 
Ug NY = B. Choose also an element Ag of A such that B C Ag. Let C be the 
collection consisting of all the sets Ug N Ag, for B € B, along with all the sets A — Y, 
for A € A. Then C is the desired open covenng of X. 


Step 2. Now let A be an open covering of X. We construct an open covenng D 
of X that refines A and has order at most m + 1. Let B be an open covering of X 
refining A that has order at most m + 1 at points of Y. Then let C be an open covering 
of X refining B that has order at most m + | at points of Z. . 

We form a new covering D of X as follows: Define f : C + B by choosing for 
each C € C an element f (C) of B such that C C f(C). Given B € B, define D(B) 
to be the union of ail those elements C of C for which f (C) = B. (Of course, D(B) 
is empty if B is not in the image of f.) Let D be the collection of all the sets D(B), 
for B € B. 

Now 2D refines B, because D(B) C B for each B; therefore, D refines A. Also, 
D covers X because C covers X and C C D(f(C)) for each C € C. We show that 
D has order at most m + 1. Suppose x € D(B,)N---M D(B;), where the sets D(B;) 
are distinct. We wish to prove that k < m + 1. Note that the sets B4, ..., Bg must be 
distinct because the sets D(B;) are. Because x € D(B;), we can choose for each i, a 
set C; € C such that x e C; and f(C;) = B;. The sets C; are distinct because the sets 
B; are. Furthermore, 


xEe[CyN---AC YC [D(B))N--- A DCB) cB N- NA By). 
If x happens to lie in Y, then k < m + 1 because B has order at most m + I at points 


of Y; and if x is in Z, then k < m + 1 because C has order at most m + | at points 
of Z. E 
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Corollary 50.3. Let X = Y, U---U Yg, where each Y, is a closed subspace of X and 
is finite dimensional. Then 


dim X = max{dim Y}, ..., dim Y;}. 


EXAMPLE 4. Every compact |-manifold X has topological dimension 1. The space X 
can be written as a finite union of spaces that are homeomorphic to the unit interval [0, 1}; 
then the preceding corollary applies 


EXAMPLE 5. Every compact 2-manifold X has topological dimension at most 2. The 
space X can be wnitten as a finite union of spaces that are homeomorphic to the closed unit 
ball in R?, then the preceding corollary applies. 

An obvious question occurs at this point: Does a compact 2-manifold have topological 
dimension precisely 2? The answer is “yes,” but the proof is not easy; it requires the tools of 
algebraic topology. We will prove in Part II of this book that every closed triangular region 
in R? has topological dimension at least 2. (See §55.) It then follows that any compact 
subspace of R? that contains a closed triangular region has topological dimension 2, from 
which it follows that every compact 2-manifold has topological dimension 2 


EXAMPLE 6. An arc A is a space homeomorphic to the closed unit interval, the end 
points of A are the points p and q such that A — { p} and A — {q} are connected A (finite) 
linear graph G is a Hausdorff space that is written as the union of finitely many arcs, each 
pair of which intersect in at most a common end point. The arcs in the collection are called 
the edges of G, and the end points of the arcs are called the vertices of G Each edge 
of G, being compact, is closed in G; the preceding corollary tells us that G has topological 
dimension | 

Two particular linear graphs are sketched in Figure 50.2. The first is a diagram of 
the familiar “gas-water-electricity problem”; the second is called the “complete graph on 
five vertices.” Neither of them can be imbedded in R?. Although this fact is “intuitively 
obvious,” it is highly nontnvial to prove We shall give a proof in §64. 


Figure 50.2 


EXAMPLE 7. Every finite linear graph can be imbedded in R?. The proof involves the 
notion of “general position.” A set S of points of R? is said to be in general position if no 
three of the points of S are collinear and no four of them are coplanar. It is easy to find 
such a set of points. For example, the points of the curve 


S={(t,0?,°)] eR} 
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are in general position For if four of these points belonged to a single plane Ax + By + 
Cz = D, then the polynomial equation 


At+ B? +C =D 


would have four distinct real roots! And if three of these points belonged to a single line, 
we could take an additional point of $ and obtain four points that lie on a plane. 

Now, given a finite linear graph G, with vertices v, .., vn, let us choose a set 
{Z1,.. , Zn} of points of R? that is in general position. Define a map f . G + R? by 
letting f map the vertex v; to the point z;, and map the edge joining v; and vj homeo- 
morphically onto the line segment joining z; and z;. Now each edge of G is closed in G 
It follows that f is continuous, by the pasting lemma. We show that f is injective, from 
which it follows that f is an imbedding. Let e = vivj and e’ = ugum be two edges of 
G If they have no vertex in common, then the line segments f(e) and f(e’) are disjoint, 
for otherwise the points z,, Z,, Zk, Zm would be coplanar. And if e and e’ have a vertex in 
common, so that i = k, say, then the line segments f(e) and f(e’) intersect only in the 
point Z; = Zg, for otherwise Z,, Z}, and Zm would be collinear. 


Now we prove our general imbedding theorem, to the effect that every compact 
metrizable space of topological dimension m can be imbedded in R2"*!. This theorem 
is another “deep” theorem; it is not at all obvious, for instance, why 2m + 1 should be 
the crucial dimension. That will come out in the course of the proof. 

To prove the imbedding theorem, we shall need to generalize the notion of general 
position to R”. This involves a bit of the analytic geometry of R”, which is nothing 
more than the usual linear algebra of R” translated into somewhat different language. 


Definition. A set {xo,. , Xx} of points of RY is said to be geometrically indepen- 
dent, or affinely independent, if the equations 


k k 
Yax =0 and Jai =0 
i=0 i=0 


hold only if each a, = 0. 


Obviously, a set consisting of only one point is geometrically independent. But 
what does geometric independence mean in general? If we solve the second equa- 
tion for ag and plug the answer into the first equation, we see that this definition is 
equivalent to the statement that the equation 


k 
Jai — xo) = 0 


holds only if each a; = 0. This is just the definition of linear independence for the set 
of vectors Xı — Xo, .- , Xk — Xo of the vector space R". This gives us something to 
visualize: Any two distinct points form a geometrically independent set. Three points 
form a geometrically independent set if they are not collinear. Four points in R? form 
a geometrically independent set if they are not coplanar. And so on. 
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It follows from these remarks that the points 


0 =(0,0,...,0), 
e = (1,0,..., 0), 
en = (0,0,..., 1) 


are geometnically independent in RY. It also follows that any geometrically indepen- 
dent set of points in RY contains no more than N + 1 points 


Definition. Let {xo, ... , xx} be a set of points of RY that is geometrically indepen- 
dent. The plane P determined by these points is defined to be the set of all points x 
of R” such that 


k k 
x=) 4x, where $ 4; = 1. 
i=O i=0 


It is simple algebra to check that P can also be expressed as the set of all points x 
such that 


k 
(*) x= Xo + )_ai(xi — xo) 


for some scalars a;,..., ax. Thus P can be described not only as “the plane determined 
by the points xo, ..., Xx,” but also as “the plane passing through the point xp parallel 
to the vectors x} — Xo, -.., Xk — Xo.” 

Consider now the homeomorphism T : RY — RY defined by the equation 
T(x) = X— Xp. It is called a translation of RY. Expression (+) shows that this 
map cames the plane P onto the vector subspace V* of RY having as basis the vectors 
XL — Xo,- .,X% — Xo. For this reason, we often call P a k-plane in R”. 

Two facts follow at once: First, if k < N, the k-plane P necessarily has empty 
interior in R (because V* does). And second, if y is any point of R“ not lying in P, 
then the set 


{Xo,---,Xk, y} 


is geometrically independent. For if y ¢ P, then T(y) = y — Xp is not in V*. By 
a standard theorem of linear algebra, the vectors {x} — Xp,. ., Xk — Xo, Y — Xo} are 
linearly independent, from which our result follows. 


Definition. A set A of points of R“ is said to be in general position in R" if every 
subset of A containing N + i or fewer points is geometrically independent. 


In the case of RÌ, this is the same as the definition given earlier, as you can check. 
y 
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Lemma 50.4. Given a finite set {x,,..., Xn} of points of R“ and given ô > 0, 
there exists a set {y1,...,Yn} of points of R” in general position m RY, such that 
Ix; — yi} < ô for alli. 


Proof. We proceed by induction. Set y, = xı. Suppose that we are given y4, ..-, Yp 
in general position in R”. Consider the set of all planes in R” determined by subsets 
of (Y1... , Yp} that contain N or fewer elements. Every such subset is geometncally 
independent and determines a k-plane of R” for some k < N — 1. Each of these planes 
has empty interior in R“ Because there are only finitely many of them, their union 
also has empty intenor in R”. (Recall that R” is a Barre space.) Choose y pti to be 
a point of R” within ô of Xp+1 that does not lie in any of these planes. It follows at 
once that the set 


C ={y1,---.¥p. ypt} 


is in general position in R“. For let D be any subset of C containing N + 1 or fewer 
elements. If D does not contain yp+1, then D is geometrically independent by the 
induction hypothesis. If D does contain yp41, then D — {Yp+1} contains N or fewer 
points and yp+1 is not in the plane determined by these points, by construction. Then 
as noted above, D is geometrically independent. E 


Theorem 50.5 (The imbedding theorem). Every compact metnzable space X of 
topological dimension m can be imbedded in R?" +!. 


Proof. Let N = 2m + 1. Let us denote the square metnc for R” by 
Ix — y| = max{|x; — yil; i =1,..., N}. 
Then we can use p to denote the corresponding sup metnc on the space C (X, R”); 


pif. 8) = sup{if (x) — g(x), x € X}. 


The space C(X, RY) is complete in the metnc p, since R” is complete in the square 
metric 

Choose a metnc d for the space X; because X is compact, d is bounded. Given a 
continuous map f : X — R“, let us define 


A(f) = sup{diam f~"({z}) | z € f(X). 


The number A(f) measures how far f “deviates” from being injective; if A (f) = 0, 
each set f~'({z}) consists of exactly one point, so f is injective. 

Now, given € > 0, define Ue to be the set of all those continuous maps f : X > 
R“ for which A(f) < «e; it consists of all those maps that “deviate” from being 
injective by less than €. We shall show that Ue is both open and dense in C(X, R“). 
It follows that the intersection 

N U Hn 


neZ 
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is dense in C(X, R”) and is in particular nonempty. 

If f is an element of this intersection, then A(f) < 1/n for every n. Therefore, 
A(f) = Oand f is injective. Because X is compact, f is an imbedding. Thus, the 
imbedding theorem is proved. 

(1) Ue is open in C(X, R“). Given an element f of Ue, we wish to find some 
ball Bp(f, 6) about f that is contained in Ue. First choose a number b such that 
A(f) <b < e. Note that if f(x) = f(y) = z, then x and y belong to the set f~! ({z}), 
so that d(x, y) must be less than b It follows that if we let A be the following subset 
of X xX, 


A= (xx y | d(x, y) 2 5}, 


then the function | f(x) — f(y)| is positive on A. Now A is closed in X x X and 
therefore compact; hence the function | f(x) — f(y)| has a positive minimum on A. 
Let 


5 = 4 min( f(x) — fO); x x y € A}. 


We assert that this value of 6 will suffice. 
Suppose that g is a map such that p(f, g) < 5. If x x y € A, then | f(x) -— f(y) = 
28 by definition; since g(x) and g(y) are within ô of f(x) and f(y), respectively, we 
must have |g(x) — g(y)| > 0. Hence the function {g(x) — g(y)| is positive on A. Asa 
result, if x and y are two points such that g(x) = g(y), then necessarily d(x, y) < b. 
We conclude that Ag < b < €, as desired. 
(2) Ue is dense in C(X, RY). This is the difficult part of the proof. We need to 
use the analytic geometry of R” discussed earlier. Let f € C(X, R“). Given € > 0 
and given 5 > 0, we wish to find a function g € C(X, RY) such that g € Ue and 
olf. 8) <6. 
Let us cover X by finitely many open sets {U}, . .. , Un} such that 
(1) diam U; < €/2 in X, 
(2) diam f(U;) < 5/2 in RY, 
(3) {U,,  ., Un} has order < m + 1. 
Let {$;} be a partition of unity dominated by {U;} (see §36). For each i, choose a point 
x; € U;. Then choose, for each i, a point z, € RY such that z; is within 6/2 of the 
point f (x;), and such that the set {21,...,2,} is in genera! position in R“. Finally, 
define g : X —> R” by the equation 


g(x) = >> giz. 
i=l 


We assert that g is the desired function. 
First, we show that p(f, g) < 6. Note that 


a(x) — F(x) = D> dias — D> gi) FH); 
i=! i=l 
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here we use the fact that )> ¢;(x) = 1 Then 


a(x) — fe) = gia — FD) + Yo afd) — FO). 


Now |z; — f(x;)| < 6/2 for each i, by choice of the points z,. And if i is an index 
such that ġ; (x) # 0, then x € Uj; because we have diam f(U;) < 6/2, it follows that 
Lf (xi) — f(x) < 6/2. Since $ ġ;(x) = 1, we conclude that |g(x) — f (x)| < ô. 
Therefore, p(g, f} < 4, as desired. 

Second, we show that g € U,. We shall prove that if x, y € X and g(x) = g(y), 
then x and y belong to one of the open sets U,, so that necessarily d(x, y) < €/2 
(since diam U; < €/2). As a result, A(g) < €/2 < €, as desired. 

So suppose g(x) = g(y). Then 


ñ 
Plae) -p Oz = 0. 
i=l 
Because the covering {U;} has order at most m + 1, at most m + 1 of the numbers ġ; (x) 
are nonzero, and at most m + 1 of the numbers ¢;(y) are nonzero. Thus, the sum 
Yl¢i(x) — ¢i(y)]z; has at most 2m + 2 nonzero terms. Note that the sum of the 
coefficients vanishes because 


Yili) - 6:0) = 1-1 =0. 


The points z; are in general position in R”, so that any subset of them having N + 1 
or fewer elements is geometncally independent. And by hypothesis N + 1 = 2m +2. 
(Aha!) Therefore, we conclude that 


gi (x) — Gi (y) =0 
for all i. 
Now ¢;(x) > 0 for some i, so that x € U;. Since ¢;(y) = ¢;(x), we have y € U, 
also, as asserted. a 


To give some content to the imbedding theorem, we need some more examples of 
spaces that are finite dimensional. We prove the following theorem. 


Theorem 50.6. Every compact subspace of R" has topological dimension at most N.. 


Proof. The proof is a generalization of the proof given in Example 3 for R?. Let p 
be the square metric on R”. 

Step 1. We begin by breaking R” up into “unit cubes.” Define to be the follow- 
ing collection of open intervals in R: 


$= {(n,n + Dn €Z}, 
and define K to be the following collection of one-point sets in R: 


K = {{n} |n € 2} 


314 Bare Spaces and Dimension Theory Ch. 8 


If M is an integer such that0 < M < N, let Cu denote the set of all products 
C=A,;x A2x - X An, 


where exactly M of the sets A; belong to J, and the remainder belong to K. If M > 0, 
then C is homeomorphic to the product (0, 1)” and will be called an M-cube. If 
M =0, then C consists of a single point and will be called a 0-cube 

Let C = @gUC@,;U - U Cy. Note that each point x of R“ lies in precisely one 
element of C because each real number x; lies in precisely one element of 4 U K. 
We shall expand each element C of C slightly to an open set U (C) of R” of diameter 
at most 3/2, in such a way that if C and D are two different M-cubes, then U(C) 
and U(D) are disjoint. 

Let x = (x1,..., xy) be a point of the M-cube C. We show that there is a number 
€(x) > 0 such that the €(x)-neighborhood of x intersects no M-cube other than C. If 
C is a 0-cube, we set €(x) = 1/2 and we are finished. Otherwise, M > 0, and exactly 
M of the numbers x; are not integers. Choose € < 1/2 so that for each x; that is not an 
integer, the interval (x; — €, x; + €) contains no integer. If y = (y1, .. , yn) ìs a point 
lying in the €-neighborhood of x, then y; is nonintegral whenever x; is nonintegral. 
This means that y either belongs to the same M -cube as x does, or y belongs to some 
L-cube for L > M. In either case, the €-neighborhood of x intersects no M-cube other 
than C. 

Given an M-cube C, we define the neighborhood U(C) of C to be the union of 
the €(x)/2-neighborhoods of x for all x € C It is then immediate that if C and D are 
different M-cubes, U (C) and U (D) are disjoint. Furthermore, if z is a point of U (C), 
then d(z,x) < €(x)/2 < 1/4 for some point x of C. Since C has diameter 1, the 
set U(C) has diameter at most 3/2. 

Step 2. Given M with O < M < N, define Ay to be the collection of all 
sets U(C), where C € Cy. The elements of Am are disjoint, and each has diam- 
eter at most 3/2. The remainder of the proof is a copy of the proof given in Example 3 
for R2. E 


Corollary 50.7. Every compact m-manifold has topological dimension at most m 
Corollary 50.8. Every compact m -manifold can be imbedded in R2"+!, 


Corollary 50.9. Let X be a compact metrizable space. Then X can be imbedded in 
some euclidean space R“ if and only if X has finite topological dimension. 


As mentioned earlier, much of what we have proved holds without assumption of 
compactness. We ask you to prove the appropriate generalizations in the exercises that 
follow 

One thing we do not ask you to prove is the fact that the topological dimension 
of an m-manifold is precisely m. And for good reason; the proof requires the tools of 
algebraic topology. 
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Nor do we ask you to prove that N = 2m + | is the smallest value of N such that 
every compact metnzable space of topological dimension m can be imbedded in R”, 
The reason is the same. Even in the case of a linear graph, where m = | , the proof is 
nontrivial, as we remarked earlier. 

For further results in dimension theory, the reader is referred to the classical book 
of Hurewicz and Wallman [H-W]. In particular, this book discusses another, entirely 
different, definition of topological dimension, due to Menger and Urysohn. It is an 
inductive definition. The empty set has dimension — 1. And a space has dimension 
at most n if there is a basis for its topology such that for each basis element B, the 
boundary of B has dimension at most n — | The dimension of a space is the smallest 
value of n for which this condition holds. This notion of dimension agrees with ours 
for compact metnzable spaces. 


Exercises 


1. Show that any discrete space has dimension 0 


2. Show that any connected T; space having more than one point has dimension at 
least 1. 

3. Show that the topologist’s sine curve has dimension 1. 

4. Show that the points 0, €1, €2, €3, and (1, 1, 1) are in general position in R?. 
Sketch the corresponding imbedding into R? of the complete graph on five ver- 
tices. 


5. Examine the proof of the imbedding theorem in the case m = 1 and show that 
the map g of part (2) actually maps X onto a linear graph in R?. 


6. Prove the following: 

Theorem. Let X be a locally compact Hausdorff space with a countable basis, 

such that every compact subspace of X has topological dimension at most m. 

Then X is homeomorphic to a closed subspace of R2"*!. 

Proof. If f : X —> R” is a continuous map, we say f(x) > œas x > œ if 

given n, there is a compact subspace C of X such that f(x) > n for x € X —C. 

(a) Let @ be the uniform metnc on C(X, RY). Show that if f(x) — œ as 
x — œ and p(f, g) < 1, then g(x) — co as x > oo. 

(b) Show that if f(x) — œ as x — oo, then f extends to a continuous map of 
one-point compactifications. Conclude that if f is injective as well, then f 
is a homeomorphism of X with a closed subspace of R”. 

(c) Given f : X —> R“ and given a compact subspace C of X, let 


UAC) = {f | AIC) < e}. 


Show that U,(C) is open in C(X, R“). 
(d) Show that if N = 2m + 1, then Ue (C) is dense in C (X, R”). [Hint Given f 
and given €, ô > 0, choose g : C > RN so that d( f(x), g(x)) < 8 for 
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x € C,and A(g) < €. Extend f —g to h : X — [—68, 5)” using the Tietze 
theorem.] 

(e) Show there exists a map f : X — R such that f(x) > oo asx > oo. 
[Hint: Write X as the union of compact subspaces C, such that C, C 
IntC,41 for each n.] 

(f) Let C, be as in (e). Use the fact that (1) U1jn(Cn) is dense in C(X, RY) to 
complete the proof. 


. Corollary. Every m-manifold can be imbedded in R2"+! as a closed subspace. 
© Recall that X is said to be ø -compact if there is a countable collection of compact 


subspaces of X whose interiors cover X 

Theorem. Let X be a o -compact Hausdorff space. If every compact subspace 

of X has topological dimension at most m, then so does X. 

Proof. Let A be an open cover of X. Find an open cover B of X refining A that 

has order at most m + 1, as follows: 

(a) Show that X = J Xn, where Xn is compact and X» C Int X,4, for each n. 
Let Xo = Ø. 

(b) Find an open covering Bo of X refining A such that for each n, each element 
of Bo that intersects X, lies in Xn41- 

(c) Suppose n > 0 and B, is an open covering of X refining Bo such that 
B, has order at most m + } at points of X,. Choose an open covering C 
of X refining B, that has order at most m + 1 at points of X,4,. Choose 
f -C —> Bn sothatC C f(C). For B € By, let D(B) be the union of 
those C for which f(C) = B. Let 8,4; consist of all sets B € B, for 
which B N X,-, Æ ©, along with all sets D(B) for which B € &, and 
BO Xn-1 = Ø. Show that B,,1 is an open covenng of X that refines 8, 
and has order at most m + 1 at points of Xn41. 

(d) Define B as follows: Given a set B, it belongs to 8 if there is an N such 
that B € Bn for alin > N. 


. Corollary. Every m-manifold has topological dimension at most m. 
10. 
11. 


Corollary. Every closed subspace of R" has topological dimension at most N . 
Corollary. A space X can be imbedded as a closed subspace of R” for some N 
if and only if X is locally compact and Hausdorff with a countable basis, and has 
finite topological dimension. 


*Supplementary Exercises: Locally Euclidean Spaces 


A space X is said to be locally m-euclidean if for each x € X, there is a neighborhood 
of x that is homeomorphic to an open set of R™. Such a space X automatically satisfies 
the T, axiom, but it need not be Hausdorff. (See the exercises of §36.) However, if X 
is Hausdorff and has a countable basis, then X is called an m-manifold. 
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Throughout these exercises, let X be a space that is locally m-euclidean. 
1. Show that X is locally compact and locally metrizable. 
2. Consider the following conditions on X: 
(i) X is compact Hausdorff. 
(ii) X is an m-manifold. 
(iii) X is metrizable. 
(iv) X is normal. 
(v) X is Hausdorff. 
Show that (i) > (ii) = (iti) > (iv) > (v). 
3. Show that R is locally !1-euclidean and satisfies (ii) but not (i). 
4. Show that R x R in the dictionary order topology is locally l-euclidean and 
satisfies (iii) but not (ii). 
5. Show that the long line is locally 1-euclidean and satisfies (iv) but not (iii). (See 
the exercises of §24.) 


*6. There is a space that is locally 2-euclidean and satisfies (v) but not (iv). It is 
constructed as follows. Let A be the following subspace of R?: 


A = {(x, y,0) | x > 0}. 
Given c real, let B, be the following subspace of R?: 
Bo = ((x, yc) | x < 0}. 


Let X be the set that is the union of A and all the spaces B,, for c real. Topologize 
X by taking as a basis all sets of the following three types: 
(i) U, where U is open in A. 
(ii) V, where V is open in the subspace of B, consisting of points with x < 0. 
(iii) For each open interval J = (a, b) of R, each real number c, and each € > 0, 
the set A-(/, €) U B.(/, €), where 


AcU, €) = {(x, y,0) |0 <x <€ and c+ax < y<c+ bx}, 
B.(1, €) = {(x, y,c) | -€ < x < Oanda < y < b}. 


The space X is called the “Prüfer manifold.” 

(a) Sketch the sets Ac (Z, €) and B,(/, €). 

(b) Show the sets of types (i}iii) form a basis for a topology on X. 
(c) Show the map fe : R? > X given by 


(x,c + xy,0) forx >0, 
(x, y,c) forx <0 


fex, y) = | 


defines a homeomorphism of R? with the subspace A U Be of X. 
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(d) Show that A U Be is open in X; conclude that X is 2-euclidean. 
(e) Show that X is Hausdorff. 
(f) Show that X is not normal. [Hint: The subspace 


L = {(0,0,c)|c¢ eR} 


of X is closed and discrete. Compare Example 3 of §31.] 
7. Show that X is Hausdorff if and only if X is completely regular. 
8. Show that X is metrizable if and only if X is paracompact Hausdorff. 


9. Show that if X is metnzable, then each component of X is an m-manifold. 


Ch. 8 


Part II 
ALGEBRAIC TOPOLOGY 


Chapter 9 


The Fundamental Group 


One of the basic problems of topology is to determine whether two given topological 
spaces are homeomorphic or not. There is no method for solving this problem in 
general, but techniques do exist that apply in particular cases. 

Showing that two spaces are homeomorphic is a matter of constructing a contin- 
uous mapping from one to the other having a continuous inverse, and constructing 
continuous functions is a problem that we have developed techniques to handle. 

Showing that two spaces are not homeomorphic is a different matter. For that, 
one must show that a continuous function with continuous inverse does nor exist. If 
one can find some topological property that holds for one space but not for the other, 
then the problem is solved—the spaces cannot be homeomorphic. The closed interval 
[0, 1] cannot be homeomorphic to the open interval (0, 1), for instance, because the 
first space is compact and the second one is not. And the real line R cannot be home- 
omorphic to the “long line” L, because R has a countable basis and L does not. Nor 
can the real line R be homeomorphic to the plane R?; deleting a point from R? leaves 
a connected space remaining, and deleting a point from R does not. 

But the topological properties we have studied up to now do not carry us very far 
in solving the problem. For instance, how does one show that the plane R? is not 
homeomorphic to three-dimensional space R?? As one goes down the list of topolog- 
ical properties—compactness, connectedness, local connectedness, metrizability, and 
so on—one can find no topological property that distinguishes between them. As an- 
other example, consider such surfaces as the 2-sphere S?, the torus T (surface of a 
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doughnut), and the double torus T#T (surface of a two-holed doughnut). None of the 
topological properties we have studied up to now will distinguish between them. 

So we must introduce new properties and new techniques. One of the most natural 
such properties is that of simple connectedness. You probably have studied this notion 
already, when you studied line integrals in the plane. Roughly speaking, one says that 
a space X is simply connected if every closed curve in X can be shrunk to a point 
in X. (We shall make this more precise later.) The property of simple connectedness, 
it turns out, will distinguish between R? and R?, deleting a point from R? leaves a 
simply connected space remaining, but deleting a point from R? does not. It will also 
distinguish between S? (which is simply connected) and the torus T (which is not). 
But it will not distinguish between T and T#T ; neither of them is simply connected. 

There is an idea more general than the idea of simple connectedness, an idea that 
includes simple connectedness as a special case. It involves a certain group that is 
called the fundamental group of the space. Two spaces that are homeomorphic have 
fundamental groups that are isomorphic. And the condition of simple connectedness 
is just the condition that the fundamental group of X is the trivial (one-element) group. 
Thus, the proof that $? and T are not homeomorphic can be rephrased by saying that 
the fundamental group of S? is trivial and the fundamental group of T is not. The 
fundamental group will distinguish between more spaces than the condition of simple 
connectedness will. It can be used, for example, to show that T and T#T are not 
homeomorphic; it turns out that T has an abelian fundamental group and T#T does 
not. 

In this chapter, we define the fundamental group and study its properties. Then 
we apply it to a number of problems, including the problem of showing that various 
spaces, such as those already mentioned, are not homeomorphic. 

Other applications include theorems about fixed points and antipode-preserving 
maps of the sphere, as well as the well-known fundamental theorem of algebra, which 
says that every polynomial equation with real or complex coefficients has a root. Fi- 
nally, there is the famous Jordan curve theorem, which we shall study in the next 
chapter; it states that every simple closed curve C in the plane separates the plane into 
two components, of which C is the common boundary. 

Throughout, we assume familiarity with the quotient topology (§22) and local 
connectedness (§25). 


§51 Homotopy of Paths 


Before defining the fundamental group of a space X, we shall consider paths on X and 
an equivalence relation called path homotopy between them. And we shall define a 
certain operation on the collection of the equivalence classes that makes it into what is 
called in algebra a groupoid. 
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Definition. If f and f’ are continuous maps of the space X into the space Y, we say 
that f is homotopic to f’ if there is a continuous map F . X x I — Y such that 


F(x,0)= f(x) an Fe, D= f(x) 


for each x. (Here J = [0, 1].) The map F is called a homotopy between f and f’. If 
f is homotopic to f’, we write f x f’. If f x f’ and f' is a constant map, we say 
that f is nulhomotopic. 


We think of a homotopy as a continuous one-parameter family of maps from X 
to Y If we imagine the parameter ż as representing time, then the homotopy F rep- 
resents a continuous “deforming” of the map f to the map f’, as ¢ goes from 0 to 
1. 

Now we consider the special case in which f is a path in X. Recall that if f : 
(0, 1] — X is a continuous map such that f (0) = xo and f(1) = x, we say that f is 
a path in X from xo to xı. We also say that xo is the initial point, and x, the final point, 
of the path f. In this chapter, we shall for convenience use the interval 7 = [0, 1] as 
the domain for all paths. 

If f and f’ are two paths in X, there is a stronger relation between them than mere 
homotopy. It is defined as follows: 


Definition. Two paths f and f’, mapping the interval 7 = {0, 1] into X, are said to 
be path homotopic if they have the same initial point xo and the same final point x, 
and if there is a continuous map F : J x I —> X such that 


F(s,0)= f(s) and F(s,1) = f(s), 
F(0, t) = xo and F(i,t)= x), 


for each s € J and each ¢ € 7. We call F a path homotopy between f and f’ See 
Figure 51.1. If f is path homotopic to f’, we wnte f =, f’. 


Figure 51.1 
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The first condition says simply that F is a homotopy between f and f’, and the 
second says that for each ¢, the path f, defined by the equation f,(s) = F(s, t) is a path 
from xo to xı. Said differently, the first condition says that F represents a continuous 
way of deforming the path f to the path f’, and the second condition says that the end 
points of the path remain fixed during the deformation. 


Lemma 51.1. The relations = and =, are equivalence relations. 


If f is a path, we shall denote its path-homotopy equivalence class by [ f}. 
Proof. Let us verify the properties of an equivalence relation. 

Given f, itis tnvial that f ~ f;the map F(x, t) = f(x) is the required homotopy. 
If f isa path, F is a path homotopy. 

Given f = f’, we show that f’ x f. Let F be a homotopy between f and f’. 
Then G(x, t) = F(x, 1 — t) isa homotopy between f’ and f. If Fis a path homotopy, 
so is G. 

Suppose that f = f’ and f’ œ f”. We show that f ~ f”. Let F be a homotopy 
between f and f’, and let F’ be a homotopy between f’ and f”. Define G : X x 1 > 
Y by the equation 


F(x, 2t) for: e (0, 5). 


Gix,1) = F'(x,2t-2) forte l4, 1). 


The map G is well defined, since if : = 3, we have F(x, 2) = f'(x) = F'(x, 2t — 1). 
Because G is continuous on the two closed subsets X x [0, 5] and X x (5. l]of Xx J, it 
is continuous on all of X x J, by the pasting lemma. Thus G is the required homotopy 
between f and f”. 

You can check that if F and F’ are path homotopies, so is G. See Figure 51.2. @ 


Figure 51.2 


EXAMPLE |. Let f and g be any two maps of a space X into R? Itis easy to see that f 
and g are homotopic; the map 


F(x, t) = (1-8) f(x) + tax) 
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is a homotopy between them. It is called a straight-line homotopy because it moves the 
point f(x) to the point g(x) along the straight-line segment joining them. 

If f and g are paths from xo to x4, then F will be a path homotopy, as you can check. 
This situation is pictured in Figure 51.3. 

More generally, let A be any convex subspace of R”. (This means that for any two 
points a, b of A, the straight line segment joining a and b is contained in A.) Then any two 
paths f, g in A from xo to x; are path homotopic in A, for the straight-line homotopy F 
between them has image set in A 


Figure 51.3 Figure 51.4 


EXAMPLE 2 Let X denote the punctured plane, R? — (0}, which we shall denote by 
R? — 0 for short The following paths in X, 


f(s) = (cos xs, sin zs), 
g(s) = (cos ms, 2sinzs) 


are path homotopic; the straight-line homotopy between them is an acceptable path homo- 
topy But the straight-line homotopy between f and the path 


h(s} = (cosxs, — sin zs) 


is not acceptable, for its image does not lie in the space X = R? — 0. See Figure 51.4. 

Indeed, there exists no path homotopy in X between paths f and h This result is 
hardly surprising, it is intuitively clear that one cannot “deform f past the hole at 0” with- 
out introducing a discontinuity. But it takes some work to prove. We shall return to this 
example later 

This example illustrates the fact that you must know what the range space is before 
you can tell whether two paths are path homotopic or not The paths f and k would be 
path homotopic if they were paths in R? 


Now we introduce some algebra into this geometric situation. We define a certain 
operation on path-homotopy classes as follows: 
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Definition. If f is a path in X from xo to xj, and if g is a path in X from x, to x2, 
we define the product f * g of f and g to be the path h given by the equations 


f (2s) for s € (0, 5], 


hs) = g(2s — 1) for s e [}, 1]. 


The function A is well-defined and continuous, by the pasting lemma; it is a path in 
X from x9 to x2. We think of h as the path whose first half is the path f and whose 
second half is the path g. 


The product operation on paths induces a well-defined operation on path-homotopy 
classes, defined by the equation 


[f] * [g] = [/ *8] 


To verify this fact, let F be a path homotopy between f and f’ and let G be a path 
homotopy between g and g’. Define 


F(2s,t) for s € [0, 5]. 


H(s,t) = G(2s—1,t) forse (5. 1). 


Because F(l, t) = xı = G(0. ż) for all t, the map H is well-defined; it is continuous 
by the pasting lemma. You can check that H is the required path homotopy between 
f *gand f'» g’ Itis pictured in Figure 51.5. 


The operation * on path-homotopy classes turns out to satisfy properties that look 
very much like the axioms for a group. They are called the groupoid properties of x. 
One difference from the properties of a group is that [ f] * [g] is not defined for every 
pair of classes, but only for those pairs [ f ], [g] for which f (1) = g(0). 


Theorem 51.2. The operation * has the following properties: 
(1) (Associativity) If [ f) * ([g] » [h]) is defined, so is ({ f] * [g]) * {h], and they are 
equal. 
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(2) (Right and left identities) Given x € X, let ex denote the constant path e; © 1 > 
X carrying all of I to the point x. If f is a path in X from xo to x), then 


(fl*lexJ=(f] and [ex]*[f]= [f]. 


(3) (Inverse) Given the path f in X from xo to x1, let f be the path defined by 
f(s) = f(l — s). Itis called the reverse of f Then 


(fl*(fl=les) and [fl *(f] =[ex]- 


Proof. We shall make use of two elementary facts. The first is the fact that if k : 
X — Y is a continuous map, and if F is a path homotopy in X between the paths f 
and f’, then k o F is a path homotopy in Y between the paths k o f and k o f’. See 
Figure 51.6. 


Figure 51.6 


The second is the fact that if k : X — Y is a continuous map and if f and g are 
paths in X with f(1) = g(0), then 


ko(f *g) = (ko f) * (kog). 


This equation follows at once from the definition of the product operation *. 


Step 1. We verify properties (2) and (3). To verify (2), we let eo denote the constant 
path in / at O, and we let i : Z > I denote the identity map, which is a path in / from 0 
to 1. Then eg + i is also a path in J from 0 to 1. (The graphs of these two paths are 
pictured in Figure 51.7.) 


u=(e,*/)(s) 


Figure 51.7 
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Because / is convex, there is a path homotopy G in / between i and eg x i. Then 
f o G is a path homotopy in X between the paths f oi = f and 


fo (eo*i) = (f 0€0) * (f 01) = ex * f. 


An entirely similar argument, using the fact that if e} denotes the constant path at 1, 
then i * e; is path homotopic in / to the path i, shows that [f] * [ex,] = [ f1. 

To verify (3), note that the reverse of i is t(s) = 1 — s. Then i +T is a path in / 
beginning and ending at 0, and so is the constant path eg. (Their graphs are pictured 
in Figure 51.8.) Because / is convex, there is a path homotopy H in I between eg and 
i «i. Then f o H is a path homotopy between f o eo = ex, and 


Uf oi) *(fol) = fx f. 


An entirely similar argument, using the fact that 7 * i is path homotopic in / to e4, 
shows that [ f] *[f] = [ex,]. 


u=(/*7)(s) 


u=e@,(S) 


Figure 51.8 


Step 2. The proof of (1), associativity, is a bit trickier. For this proof, and for later 
use as well, it will be convenient to describe the product f x g in a different way. 

If [a, b] and [c, d} are two intervals in R, there is a unique map p : [a, b] > [c, d] 
of the form p(x) = mx +k that carries a to c and b to d; we call it the positive linear 
map Of [a, b] to [c, d] because its graph is a straight line with positive slope. Note that 
the inverse of such a map is another such map, and so is the composite of two such 
maps. ; 

With this terminology, the product f * g can be described as follows: On (0, 1), it 
equals the positive linear map of [0, 3] to (0, 1], followed by f; and on G, 1], it equals 
the positive linear map of G. 1] to [0, 1], followed by g. 

Now we verify (1). Given paths f, g, and h in X, the products f * (g * h) and 
(f * 8) » h are defined precisely when f (1) = g(0) and g(1) = h(0). Assuming these 
two conditions, we define also a “tnple product” of the paths f, g, and h as follows: 
Choose points a and b of 7 sothat0 <a < b < 1. Define a path ky, in X as follows: 
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On [0, a] it equals the positive linear map of (0, a] to Z followed by f; on [a, b] it 
equals the positive linear map of (a, b] to J followed by g; and on [b, 1] it equals the 
positive linear map of (b, 1] to Z followed by h. The path ka, depends of course on the 
choice of the points a and b. But its path-homotopy class does not! We show that if c 
and d are another pair of points of J withO < c < d < 1, then k¢q is path homotopic 
to kab- 

Let p : | — I be the map whose graph is pictured in Figure 51 9. When restricted 
to (0, a], [a,b], and [b, 1], respectively, it equals the positive linear maps of these 
intervals onto (0, c], (c, d], and (d, 1], respectively. It follows at once that keg o p 
equals ka,b. But p is a path in Z from 0 to 1; and so is the identity map i : J > 1. 
Hence, there is a path homotopy P in / between p andi. Then kea o P is a path 
homotopy in X between kg» and ke.d 


Figure 51.9 


What has this to do with associativity? A great deal. For the product f * (g * A) 
is exactly the triple product ka b in the case where a = 1/2 and b = 3/4, as you can 
check, while the product ( f *g)*A equals k-,q in the case where c = 1/4 and d = 1/2. 
Therefore these two products are path homotopic. a 


The argument just used to prove associativity goes through for any finite product of 
paths. Roughly speaking, it says that as far as the path-homotopy class of the result is 
concemed, it doesn’t matter how you chop up the interval when you form the product 
of paths! This result will be useful to us later, so we state it formally as a theorem here: 


Theorem 51.3. Let f be a path in X, and let ap, ..., an be numbers such that 
0 =ao <a < +--+ <a, = l. Let fi : | — X be the path that equals the positive 
linear map of I onto [a;—1, a;} followed by f. Then 


(fl=(fil*-- [fa]. 
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Exercises 


1. Show that if h, h’ - X — Y are homotopic and k,k’ : Y —> Z are homotopic, 
then k o h and k’ oh’ are homotopic. 


2. Given spaces X and Y, let [X, Y} denote the set of homotopy classes of maps 
of X into Y. 
(a) Let J = (0, 1]. Show that for any X, the set [X, I] has a single element. 
(b) Show that if Y is path connected, the set [/, Y] has a single element. 


3. A space X is said to be contractible if the identity map ix : X — X is nulho- 

motopic. 

(a) Show that / and R are contractible. 

(b) Show that a contractible space is path connected. 

(c) Show that if Y is contractible, then for any X, the set [X, Y] has a single 
element. 

(d) Show that if X is contractible and Y is path connected, then [X, Y] has a 
single element. 
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The set of path-homotopy classes of paths in a space X does not form a group under the 
operation * because the product of two path-homotopy classes is not always defined. 
But suppose we pick out a point xo of X to serve as a “base point” and restrict ourselves 
to those paths that begin and end at x9 The set of these path-homotopy classes does 
form a group under  . It will be called the fundamental group of X. 

In this section, we shall study the fundamental group and derive some of its prop- 
erties. In particular, we shall show that the group is a topological invariant of the 
space X, the fact that is of crucial importance in using it to study homeomorphism 
problems. 

Let us first review some terminology from group theory. Suppose G and G’ are 
groups, written multiplicatively A homomorphism f : G — G' is a map such that 
f(x-y) = f(x) f(y) forall x, y; it automatically satisfies the equations f (e) = e’ and 
fœ!) = foy! , where e and e’ are the identities of G and G’, respectively, and the 
exponent —1 denotes the inverse. The kernel of f is the set f7! (e'); it is a subgroup 
of G. The image of f, similarly, is a subgroup of G’. The homomorphism f is called a 
monomorphism if it is injective (or equivalently, if the kernel of f consists of e alone). 
It is called an epimorphism if it is surjective; and it is called an isomorphism if it is 
bijective 

Suppose G is a group and H is a subgroup of G. Let xH denote the set of all 
products xh, for h € H; it is called a left coset of H in G. The collection of all such 
cosets forms a partition of G. Similarly, the collection of all right cosets Hx of H in G 
forms a partition of G. We call H a normal subgroup of G if x -h - x7! € H for each 
x € Gandeachh € H. In this case, we have xH = Hx for each x, so that our two 
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partitions of G are the same. We denote this partition by G/H; if one defines 
QH): OH) = (x-y)H, 


one obtains a well-defined operation on G/H that makes it a group. This group is 
called the quotient of G by H. The map f : G —> G/H carrying x to xH is an 
epimorphism with keel H Conversely, if f : G > G” is an epimorphism, then its 
kernel N is a normal subgroup of G, and f induces an isomorphism G/N — G’ that 
carries x N to f(x) foreach x € G. 

If the subgroup H of G is not normal, it will still be convenient to use the symbol 
G/H; we will use it to denote the collection of right cosets of H in G. 

Now we define the fundamental group. 


Definition. Let X be a space; let xo be a point of X. A path in X that begins and 
ends at xo is called a loop based at x. The set of path homotopy classes of loops based 
at Xp, with the operation +, is called the fundamental group of X relative to the base 
point xo. It is denoted by 7r; (X, xo). 


It follows from Theorem 51.2 that the operation *, when restricted to this set, 
satisfies the axioms for a group. Given two loops f and g based at xo, the product 
f * g is always defined and is a loop based at x9. Associativity, the existence of an 
identity element [e,,], and the existence of an inverse [ f] for [f] are immediate. 

Sometimes this group is called the first homotopy group of X, which term implies 
that there is a second homotopy group. There are indeed groups p(X, xo) for all 
n € Z4, but we shall not study them in this book. They are part of the general subject 
called homotopy theory. 

EXAMPLE 1 Let R” denote euclidean n-space. Then 7) (R", xo) is the trivia] group (the 

group consisting of the identity alone). For if f is a loop in R” based at xo, the straight-line 

homotopy is a path homotopy between f and the constant path at x9 More generally, if X 

is any convex subset of R”, then x; (X, xo) is the trivial group. In particular, the unit ball 

B" in R°’, 

B” ={x]x? + + x2 <1), 
has tnvial fundamental group. 


An immediate question one asks is the extent to which the fundamental group 
depends on the base point. We consider that question now. 


Definition. Let a be a path in X from xo to xı. We define a map 
à :701(X, xo) — xı (X, x1) 
by the equation 
a((f)) = [a] * [f] * [a]. 
The map &, which we call “a-hat,” is well-defined because the operation is well- 
defined. If f is a loop based at xo, then & * ( f *a) is a loop based at xı. Hence & maps 


xı(X, xo) into xı (X, x1), as desired; note that it depends only on the path-homotopy 
class of æ. It is pictured in Figure 52 1. 
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Figure 52.1 


Theorem 52.1. The map a is a group isomorphism. 


Proof. To show that & is a homomorphism, we compute 


a((f]) +alle) = (la] * [f] * [a]) * ([a] * [g] * [a]) 
= [a] + [f] + [g] * [a] 
= &([f] + [g])- 


To show that @ is an isomorphism, we show that if 8 denotes the path œ, which is 
the reverse of a, then is an inverse for @. We compute, for each element [h] of 
m™(X, xı), 


B((h]) = [8] * [h] + [8] = [a] * [h] + [à], 
&(B((A])) = [à] * (la) * [h] + [@]) * [æ] = [A]. 


A similar computation shows that ÊÂ(&([f])) = [f] for each [f] € 71 (X, xo). a 


Corollary 52.2. If X is path connected and xg and x; are two points of X, then 
zı (X, xo) is isomorphic to mı (X, x1). 


Suppose that X is a topological space. Let C be the path component of X contain- 
ing xg It is easy to see that 71(C, x9) = x (X, xo), since all loops and homotopies 
in X that are based at xo must lie in the subspace C. Thus 2; (X, x9) depends on only 
the path component of X containing xp; it gives us no information whatever about the 
rest of X. For this reason, it is usual to deal with only path-connected spaces when 
studying the fundamental group 

If X is path connected, all the groups 7,(X, x) are isomorphic, so it is tempting 
to try to “identify” all these groups with one another and to speak simply of the fun- 
damental group of the space X, without reference to base point. The difficulty with 
this approach is that there is no natural way of identifying 7 (X, xg) with xı (X, x1); 
different paths a and £ from xo to x; may give rise to different isomorphisms between 
these groups. For this reason, omutting the base point can lead to error. 
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It turns out that the isomorphism of 7; (X, xo) with 71 (X, x1) is independent of 
path if and only if the fundamental group is abelian. (See Exercise 3.) This is a 
stringent requirement on the space X. 


Definition. A space X is said to be simply connected if it is a path-connected space 
and if 7,(X, xo) is the trivial (one-element) group for some x9 € X, and hence for 
every xo € X. We often express the fact that 21 (X, xo) is the trivial group by writing 
7 (X, xo) = 0. 


Lemma 52.3. In a simply connected space X, any two paths having the same initial 
and final points are path homotopic. 


Proof. Let a and £ be two paths from xp to x; Then a » is defined and is a loop 
on X based at x9. Since X is simply connected, this loop is path homotopic to the 
constant loop at xo. Then 


[æ » B] + [B] = [exo] * [8], 


from which it follows that [a] = [£]. a 


It is intuitively clear that the fundamental group is a topological invariant of the 
space X. A convenient way to prove this fact formally is to introduce the notion of the 
“homomorphism induced by a continuous map.” 

Suppose that h : X — Y is a continuous map that carries the point xo of X to the 
point yo of Y. We often denote this fact by wnting 


h : (X, x0) — (Y, yo). 
If f is a loop in X based at xo, then the composite h o f : | — Y isaloopin Y based 
at yo. The correspondence f — ho f thus gives rise to a map carrying xı (X, xo) into 
71(Y, yo). We define it formally as follows: 
Definition. Leth . (X, x0) > (Y, yo) be a continuous map. Define 
h, .1\(X, x0) — n(Y, yo) 


by the equation 


hlf) = [ho f]. 
The map h, is called the homomorphism induced by h, relative to the base point xg. 


The map h, is well-defined, for if F is a path homotopy between the paths f 
and f’, then h o F is a path homotopy between the paths h o f and h o f’. The fact 
that A, is a homomorphism follows from the equation 


(ho f) * (hog) =ho(f * 8). 
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The homomorphism h, depends not only on the map A : X —> Y but also on the choice 
of the base point xg. (Once xo is chosen, yo is determined by h.) So some notational 
difficulty will arise if we want to consider several different base points for X. If xo and 
xı are two different points of X, we cannot use the same symbol A, to stand for two 
different homomorphisms, one having domain 77; (X, xo) and the other having domain 
1 \(X, xı). Even if X is path connected, so these groups are isomorphic, they are still 
not the same group. In such a case, we shall use the notation 


(hag )e : 71(X, x0) — m (Y, yo) 


for the first homomorphism and (hx, )« for the second. If there is only one base point 
under consideration, we shall omit mention of the base point and denote the induced 
homomorphism merely by h,. 

The induced homomorphism has two properties that are crucial in the applications. 
They are called its “functonal properties” and are given in the following theorem: 


Theorem 52.4. If h : (X, xo) — (Y, yo) and k : (Y, yo) > (Z, zo) are continuous, 
then (koh), =k, oh, Ifi : (X,x9) — (X, xo) is the identity map, then i, is the 
identity homomorphism. 


Proof. The proof is a tnviality. By definition, 


(koh) (fI = (koh) 0 f), 
(k, OAs) ([f]) = ke (he (LF ])) = ke (lh 0 fI) = [ko (ho f)). 


Similarly, i.((f]) = [i o f] =[f] a 


Corollary 52.5. Ifh : (X,xo) — (Y, yo) is a homeomorphism of X with Y, then h, 
is an isomorphism of n; (X, xo) with 71(Y, yo). 


Proof. Let k- (Y. yo) —> (X, xo) be the inverse of h. Then k, oh, = (k o h), = iq, 
where i is the identity map of (X, x9); and h, ok, = (hook), = ją, where j is the 
identity map of (Y, yo). Since i, and ją are the identity homomorphisms of the groups 
1 \(X, xo) and 7 (Y, yo), respectively, ką is the inverse of h,. a 


Exercises 


1. A subset A of R” is said to be star convex if for some point ag of A, all the line 
segments joining ag to other points of A lie in A. 
(a) Find a star convex set that is not convex. 
(b) Show that if A is star convex, A is simply connected, 


2. Leta be a path in X from xo to x1; let £ be a path in X from x; to x2. Show that 
ify =a » B, then p = Bod. 
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3. Let xg and x; be points of the path-connected space X. Show that 74 (X, xo) is 
abelian if and only if for every pair a and of paths from x9 to xı, we have 
a = 8. 

4. Let A C X; supposer `: X — A is a continuous map such that z(a) = a for each 
a € A. (The map r is called a retraction of X onto A ) If ag € A, show that 


Tą : nı (X, ao) —> n(A, ag) 


is surjective. 

5. Let A be a subspace of R”; let h : (A, a9) > (Y, yo). Show that if h is extend- 
able to a continuous map of R” into Y, then h, is the trivial homomorphism (the 
homomorphism that maps everything to the identity element). 

6. Show that if X is path connected, the homomorphism induced by a continuous 
map is independent of base point, up to isomorphisms of the groups involved. 
More precisely, let h : X — Y be continuous, with h(xp) = yo and h(x) = yı. 
Let a be a path in X from xo to x;, and let 8 = h oa. Show that 


Bo (hay) = (hx, oå. 
This equation expresses the fact that the following diagram of maps “commutes.” 


(hig) 
1 \(X, xo) ——> n(Y, yo) 


Ee ae 
(ha) 


m (X, x4) —> m (Y, yn) 


7. Let G be a topological group with operation - and identity element xọ Let 
Q(G, xo) denote the set of all loops in G based at xo. If f,g € Q(G, xo), 
let us define a loop f @ g by the rule 


(f @ g)(s) = f(s) - g(s). 


(a) Show that this operation makes the set 2(G, x) into a group. 

(b) Show that this operation induces a group operation @ on 71 (G, xg). 

(c) Show that the two group operations * and ® on 71(G, xg) are the same. 
[Hint: Compute (f * exy) ® (exo * 8) ] 

(d) Show that 71(G, xg) is abelian. 
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We have shown that any convex subspace of R” has a trivial fundamental group; we 
tum now to the task of computing some fundamental groups that are not trivial. One 
of the most useful tools for this purpose is the notion of covering space, which we 
introduce in this section. Covering spaces are also important in the study of Riemann 
surfaces and complex manifolds. (See [A-S].) We shall study them in more detail in 
Chapter 13. 
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Definition. Let p ŒE — B be a continuous surjective map. The open set U of B 
is said to be evenly covered by p if the inverse image p~'(U) can be wntten as the 
union of disjoint open sets Vy in E such that for each a, the restriction of p to Va 
is a homeomorphism of Vy onto U The collection {Vg} will be called a partition of 
p~'(U) into slices 


If U is an open set that is evenly covered by p, we often picture the set p~!(U) 
as a “stack of pancakes,” each having the same size and shape as U, floating in the air 
above U; the map p squashes them all down onto U. See Figure 53.1. Note that if U 
is evenly covered by p and W is an open set contained in U, then W is also evenly 
covered by p. 


a) 


Figure 53.1 


Definition. Let p E — B be continuous and surjective. If every point b of B has a 
neighborhood U that is evenly covered by p, then p is called a covering map, and E 
is said to be a covering space of B 


Note that if p - E — B is a covering map, then for each b € B the sub- 
space p~!(b) of E has the discrete topology For each slice Vy is open in £ and 
intersects the set p~'(b) in a single point; therefore, this point is open in p~!(b). 

Note also that if p : E — B is a covering map, then p is an open map. For 
suppose A is an open set of E. Given x € p(A), choose a neighborhood U of x that is 
evenly covered by p Let {Vg} be a partition of p~'(U) into slices. There is a point y 
of A such that p(y) = x; let Vg be the slice containing y. The set Vg N A is open 
in £ and hence open in Vg, because p maps Vg homeomorphically onto U, the set 
p(Vg A A) is open in U and hence open in B; it is thus a neighborhood of x contained 
in p(A), as desired. 
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EXAMPLE |. Let X be any space; let i : X — X be the identity map. Then i is a 
covering map (of the most trivial sort). More generally, let E be the space X x {I, —_, a) 
consisting of n disjoint copies of X. The map p ` E — X given by p(x, i) = x for all i 
is again a (rather trivial) covering map. In this case, we can picture the entire space £ as a 
stack of pancakes over X. 


In practice, one often restricts oneself to covering spaces that are path connected, 
to eliminate trivial coverings of the pancake-stack variety. An example of such a non- 
trivial covering space is the following: 


Theorem 53.1. The map p . R > S! given by the equation 
p(x) = (cos 27x, sin27x) 
is a covering map. 
One can picture p as a function that wraps the real line R around the circle S', and 
in the process maps each interval [n, n + 1] onto S!. 
Proof. The fact that p is a covering map comes from elementary properties of the sine 
and cosine functions. Consider, for example, the subset U of S! consisting of those 


points having positive first coordinate. The set p—'(U/) consists of those points x for 
which cos 2yr x is positive; that is, it is the union of the intervals 


=(n- 40+) 


for alln € Z. See Figure 53.2. Now, restricted to any closed interval V,, the map p 
is injective because sin 27x is strictly monotonic on such an interval. Furthermore, 
p carries Vn surjectively onto U, and V, to U, by the intermediate value theorem. 
Since Vn is compact, p| Vn is a homeomorphism of Vn with U. In particular, p|V, is a 
homeomorphism of V, with U 


-3 -2 -1 0 1 2 3 
Va v_, Va Vo v, A A 
p 
u 
Figure 53.2 


Similar arguments can be applied to the intersections of S! with the upper and 
lower open half-planes, and with the open left-hand half-plane. These open sets 
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cover S!, and each of them is evenly covered by p. Hence p : R > S! is a cov- 
ering map. [L] 


If p . E — B is a covering map, then p is a local homeomorphism of E with B. 
That is, each point e of E has a neighborhood that is mapped homeomorphically by p 
onto an open subset of B. The condition that p be a local homeomorphism does not 
suffice, however, to ensure that p is a covering map, as the following example shows. 


EXAMPLE 2. The map p . Ry — S! given by the equation 
p(x) = (cos 27x, sin 27x) 


is surjective, and it is a local homeomorphism. See Figure 53.3. But it is not a covering 
map, for the point bọ = (1,0) has no neighborhood U that is evenly covered by p. The 
typical neighborhood U of bo has an inverse image consisting of small neighborhoods V, 
of each integer n for n > 0, along with a small interval Vo of the form (0, €). Each of the 
intervals V, for n > 0 is mapped homeomorphically onto U by the map p, but the interval 
Vo is only imbedded in U by p. 


YY A 
oe) 
0 1 2 
fe 
u 
dy 
Figure 53.3 


EXAMPLE 3, The preceding example might lead you to think that the real line R is the 
only connected covering space of the circle S'. This is not so. Consider, for example, the 
map p . S! — S? given in equations by 


p(z) = 2’. 


(Here we consider S! as the subset of the complex plane C consisting of those complex 
numbers z with |z| = 1 ] We leave it to you to check that p is a covering map. 


Example 2 shows that the map obtained by restricting a covering map may not be 
a covering map. Here is one situation where it will be a covering map: 


Theorem 53.2. Let p: E —> B be a covering map. If Bo is a subspace of B, and if 
Eo = p~'(Bo), then the map po : Eg — Bo obtained by restcting p is a covering 
map. 
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Proof. Given bo € Bo, let U be an open set in B containing bo that is evenly covered 
by p; let {Va} be a partition of p`! (U) into slices. Then UM Bp is a neighborhood of bo 
in Bo, and the sets Vy N Eo are disjoint open sets in Ey whose union is p~'!(U N Bo), 
and each is mapped homeomorphically onto U N Bo by p. a 


Theorem 53.3. Itp: E — B and p’: E' > B’ are covering maps, then 
px p':Ex E’ + Bx B’ 


is a covering map. 


Proof. Given b € B and b e€ B’, let U and U’ be neighborhoods of b and b’, 
respectively, that are evenly covered by p and p’, respectively. Let {Vy} and {Vp} be 
partitions of p`! (U) and (p! (U’), respectively, into slices. Then the inverse image 
under p x p’ of the open set U x U’ is the union of all the sets Va x V- These are 
disjoint open sets of E x £’, and each is mapped homeomorphically onto V x U’ by 
pxp’. E 


EXAMPLE 4. Consider the space T = S! x S}, it is called the torus The product map 
pxp RxR—S'xs! 


is a covenng of the torus by the plane R?, where p denotes the covering map of Theo- 
rem 53 ! Each of the unit squares [n, n + 1} x [m, m + 1] gets wrapped by p x p entirely 
around the torus. See Figure 53 4 


R? 


Figure 53.4 


In this figure, we have pictured the torus not as the product S! x S}, which is a subspace 
of Rî and thus difficult to visualize, but as the familiar doughnut-shaped surface D in R? 
obtained by rotating the circle Cı in the xz-plane of radius i centered at (1,0, 0) about 
the z-axis. It is not hard to see that S! x $! is homeomorphic with the surface D Let Cz 
be the circle of radius I in the xy-plane centered at the origin. Then let us map C1 x C2 
into D by defining f(a x b) to be that point into which a is carried when one rotates the 
circle C; about the z-axis until its center hits the point b See Figure 53.5. The map f 
will be a homeomorphism of C, x C2 with D, as you can check mentally. If you wish, 
you can wnite equations for f and check continuity, injectivity, and surjectivity directly. 
(Continuity of f~! will follow from compactness of Cy x C2.) 
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Figure 53.5 


EXAMPLE 5. Consider the covenng map p x p of the preceding example. Let bọ denote 
the point p(0) of $ 1. and let Bo denote the subspace 


Bo = (S! x bo) U (bọ x S!) 


of S! x S!. Then Bo is the union of two circles that have a point in common, we sometimes 
call it the figure-eight space The space Ey = p~! (Bo) is the “infinite grid” 


Eo = (R x Z)U (Z x R) 
pictured in Figure 53 4. The map po : Eg —> Bo obtained by restncting p x p is thus a 
covenng map. 


The infinite grid is but one cOvering space of the figure eight; we shall see others later 
on. 


EXAMPLE 6 Consider the covering map 
pxi:RxR,— S'xR,, 
where i is the identity map of R+ and p is the map of Theorem 53 1. If we take the standard 
homeomorphism of S! x R, with R4 — 0, sending x x 1 to tx, the composite gives us a 
covering 
RxR} — R -0 
of the punctured plane by the open upper half-plane. It is pictured in Figure 53.6. This cov- 


ering map appears in the study of complex vanables as the Riemann surface corresponding 
to the complex loganthm function. 
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Exercises 

1. Let Y have the discrete topology. Show that if p : X x Y — X is projection on 
the first coordinate, then p is a covering map. 

2. Let p : E — B be continuous and surjective. Suppose that U is an open set of B 
that is evenly covered by p. Show that if U is connected, then the partition of 
p~! (U) into slices is unique. 

3. Let p : E —> B be a covering map; let B be connected. Show that if p~! (bo) 


has k elements for some bọ € B, then p`! (b) has k elements for every b € B. 
In such a case, £ is called a k-fold covering of B. 


. Letq : X — Y andr : Y — Z be covering maps; let p = r oq. Show that if 


r`! (z) is finite for each z € Z, then p is a covering map. 


. Show that the map of Example 3 is a covering map. Generalize to the map 


p(z) = 2". 


. Let p . E — B be a covering map. 


(a) If B is Hausdorff, regular, completely regular, or locally compact Hausdorff, 
then so is £. (Hint: If {Va} is a partition of p~'(U) into slices, and C is a 
closed set of B such that C C U, then p~!(C) N Va is a closed set of E.) 

(b) If B is compact and p~!(b) is finite for each b € B, then E is compact. 
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The study of covering spaces of a space X is intimately related to the study of the 
fundamental group of X. In this section, we establish the crucial links between the 
two concepts, and compute the fundamental group of the circle. 
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Definition. Let p' E — B be a map. If f is a continuous mapping of some space X 
into B, a lifting of f isa map f : X > E such that po f = f. 


ZA 
P 
X = B 
The existence of liftings when p is a covering map is an important tool in studying 
covering spaces and the fundamental group. First, we show that for a covering space, 
paths can be lifted; then we show that path homotopies can be lifted as well. First, an 
example: 
EXAMPLE |. Consider the covenng p R — S! of Theorem 53.1. The path f ` 
(0,1) > S! beginning at bp = (1,0) given by f(s) = (cos7s, sin zs) lifts to the path 
f(s) = s/2 beginning at 0 and ending at 4 }. The path g(s) = (cos xs, — sin z s) lifts to the 
path g(s) = —s/2 beginning at 0 and ending at -i The path k(s) = (cos 4rs, sin 4x s) 
lifts to the path A(s) = 2s beginning at O and ending at 2. Intuitively, h wraps the interval 
[0, 1] around the circle twice; this is reflected in the fact that the lifted path A begins at zero 
and ends at the number 2 These paths are pictured in Figure 54 |. 


avers 
+ 
AO) 


Lemma 54.1. Let p : E — B be a covering map, let pleo) = bo. Any path 
f - (0, 1] + B beginning at bo has a unique lifting to a path f in E beginning at eo. 


-1 40 1,2 
l fe 


Figure 54.1 


Proof. Cover B by open sets U each of which is evenly covered by p. Find a subdi- 


vision of (0, 1}, say so, . ., Sn, such that for each i the set f((s;, 5;41]) lies in such an 
open set U. (Here we use the Lebesgue number lemma.) We define the lifting f step 
by step. 


First, define fO = eo. Then, supposing fls) is defined for 0 < s < si, we define 
f on [s;, 5341] as follows: The set f((s;, 5:41]) lies in some open set U that is evenly 
covered by p Let {Vy} be a partition of p~!(U) into slices; each set Vy is mapped 
homeomorphically onto U by p. Now f(s;) lies in one of these sets, say in V9. Define 
f(s) for s € [s;, 5;41] by the equation 


f(s) = (P | Voy "(Ff (5)). 


§54 The Fundamental Group of the Circle 343 


Because p|Vo : Vo > U is a homeomorphism, f will be continuous on [si> Si+) 
Continuing in this way, we define f on all of [0, 1]. Continuity of f follows from 
the pasting lemma; the fact that po f = f is immediate from the definition of f. 
The uniqueness of f is also proved step by step. Suppose that f is another lifting 
of f beginning at eọ. Then f(0) = eo = f (0). Suppose that f(s) = f(s) for all s 
such that 0 < s < s;. Let Vo be as in the preceding paragraph; then for s € (s;, 5,+1], 
f(s) is defined as (p|Vo)~'(f(s)). What can f(s) equal? Since f is a lifting of f, 
it must carry the interval [s;, 541] into the set p~'(U) = (J Va. The slices Va are 
open and disjoint; because the set f ([si, si+1]) is connected, it must lie entirely in one ` 
of the sets Vz. Because f(s;) = fisd, which is in Vo, f must carry all of [s;. si+1] 
into the set Vo. Thus, for s in (s;, 5:41], f(s) must equal some point y of Vo lying 
in p~'(f(s)). But there is only one such point y, namely, (p[¥o)~!(f(s)). Hence 
f(s) = fis) for s € [s;, 5:41]. a 


Lemma 54.2. Let p : E —> B be a covering map; let p(e9) = bo. Let the map 
F : I x 1 — B be continuous, with F (0, 0) = bo. There is a unique lifting of F toa 
continuous map 


F:IxIsE 
such that F(0, 0) = eo. If F isa path homotopy, then F isa path homotopy. 


Proof. Given F, we first define F (0,0) = eg. Next, we use the preceding lemma to 
extend F to the left-hand edge 0 x 7 and the bottom edge / x 0 of J x J. Then we 
extend F to all of / x Z as follows: 

Choose subdivisions 


So <5) < - < Sm, 


lo <t < <in 
of I fine enough that each rectangle 
li x J, = [si-i, si) x [t)-1, tj] 


is mapped by F into an open set of B that is evenly covered by p. (Use the Lebesgue 
number lemma.) We define the lifting F step by step, beginning with the rectangle 
7, x Jı, continuing with the other rectangles /, x J; in the “bottom row,” then with the 
rectangles J; x Jz in the next row, and so on. 

In general, given ig and jo, assume that F is defined on the set A which is the 
union of 0 x 7 and Z x 0 and all the rectangles “previous” to Jj, x J), (those rectangles 
l; x Jj for which j < jo and those for which j = jo andi < ig). Assume also that F 
is a continuous lifting of F|A. We define F on Lo X Ji- Choose an open set U of B 
that is evenly covered by p and contains the set F(Z) x Jjy). Let {Va} be a partition 
of p~'(U) into slices; each set Vy is mapped homeomorphically onto U by p. Now 
F is already defined on the set C = A N (lig X Jj). This set is the union of the left 
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and bottom edges of the rectangle Jj, x Jj), so it is connected. Therefore, F(C) is 
connected and must lie entirely within one of the sets Vg. Suppose it lies in Vọ Then, 
the situation is as pictured in Figure 54.2. 


Figure 54.2 


Let po : Vo > U denote the restriction of p to Vo. Since F is a lifting of F|A, we 
know that for x € C, 


po(F(x)) = p(F(x)) = F(x), 


so that F(x) = Pp (F(x)). Hence we may extend F by defining 


F(x) = pg'(F(x)) 


for x € lig x Jip- The extended map will be continuous by the pasting lemma. 

Continuing in this way, we define F on all of 77. 

To check uniqueness, note that at each step of the construction of F, as we ex- 
tend F first to the bottom and left edges of 7}, and then to the rectangles J; x Jj, one 
by one, there is only one way to extend F continuously. Thus, once the value of F at 
(0, 0) is specified, F is completely determined. 

Now suppose that F is a path homotopy. We wish to show that F is a path homo- 
topy. The map F carnes the entire left edge 0 x / of 7? into a single point by of B. 
Because F is a lifting of F, it carnes this edge into the set p~! (bg). But this set has the 
discrete topology as a subspace of E. Since 0 x / is connected and F is continuous, 
F(0 x /) is connected and thus must equal a one-point set. Similarly, F(1 x 1) must 
be a one-point set. Thus F is a path homotopy. a 


Theorem 54.3. Let p. E —> B be a covering map; let p(eo) = bo. Let f and g 
be two paths in B from bọ to bı, let f and g be their respective lifungs to paths in E 
beginning at eg. If f and g are path homotopic, then f and 2 end at the same point of 
E and are path homotopic. 
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Proof. Let F : {x I — B be the path homotopy between f and g. Then F(0, 0) = 
bo Let F : I x I — E be the lifting of F to E such that F(0,0) = ep. By the 
preceding lemma, F is a path homotopy, so that F(O x /) = {eo} and F(1 x Disa 
one-point set {e} 

The restriction F |/ x 0 of F to the bottom edge of / x I is a path on E beginning at 
eo that is a lifting of F|7 x 0. By uniqueness of path liftings, we must have F(s, 0) = 
f(s). Similarly, F\l x Lisapath on E that is a lifting of F\l x 1, and it begins at eo 
because F(0 x /) = {eo}. By uniqueness of path liftings, F(s, 1) = (s) Therefore, 
both f and 2 g end at e;, and F is a path homotopy between them. a 


Definition. Let p : E —> B be a covering map; let bọ € B Choose eg so that 
p(eo) = bo. Given an element [ f ] of 71 (B, bo), let_f be the lifting of f toa path in E 
that begins at eg. Let ġ({[ f ]) denote the end point f(1) of f. Then ¢ is a well-defined 
set map 


@ : mı (B, bo) > p~ (bo). 


We call ¢ the lifting correspondence derived from the covering map p. It depends of 
course on the choice of the point eg. 


Theorem 54.4. Let p: E — B be a covering map; let p(e9) = bo. If E is 
comnected, then the lifting correspondence 


$ : ™1(B, bo) > p~' (bo) 
is surjective. If E is simply connected, it is bijective. 


Proof. If E is path connected, then, given e) € p7! (bo), there is a path f in E from 
en toei. Then f = po f is a loop in B at bo, and o([f }) = e; by definition 

Suppose E is simply connected. Let [f] and [g] be two elements of (B, bo) 
such that #([f]) = ¢((g]). Let f and g be the liftings of f and g, respectively, to 
paths in E that begin at eg, then f= = g(1). Since E is simply connected, there is a 
path homotopy F in E between f and g. Then po F is a path homotopy in B between 
f and g a 


Theorem 54.5. The fundamental group of S! is isomorphic to the additive group of 
integers. 


Proof. Let p : R — S! be the covering map of Theorem 53.1, let eg = O, and let 
bo = p(éo). Then p`! (bọ) is the set Z of integers. Since R is simply connected, the 
lifting correspondence 


$: z(S!, bo) >Z 


is bijective. We show that ¢ is a homomorphism, and the theorem is proved. 


346 The Fundamental Group Ch. 9 


Given {f} and [g} in 71(B, bo), let f and @ be their respective liftings to paths 
on R beginning at 0. Let n = f(1) and m = g(1); then @([f]) = n and ¢({g]) = m, 
by definition. Let g be the path 


(s) =n + Bs) 


on R. Because p(n + x) = p(x) for all x € R, the path Š is a lifting of g; it begins 
at n. Then the product f * 2 is defined, and it is the lifting of f xg that begins at 0, as 
you can check. The end point of this path is (1) = n + m. Then by definition, 


olf) *[g) =n +m = lfd + (Lg). a 


Definition. Let G be a group; let x be an element of G. We denote the inverse of x 
by x7!. The symbol x” denotes the n-fold product of x with itself, x7” denotes the 
n-fold product of x~! with itself, and x° denotes the identity element of G. If the set 
of ali elements of the form x”, for m € Z, equals G, then G is said to be a cyelic 
group, and x is said to be a generator of G. 


The cardinality of a group is also called the order of the group. A group is cyclic of 
infinite order if and only if it is isomorphic to the additive group of integers; it is cyclic 
of order k if and only if it is isomorphic to the group Z/k of integers modulo k. The 
preceding theorem implies that the fundamental group of the circle is infinite cyclic. 

Note that if x is a generator of the infinite cyclic group G, and if y is an element 
of the arbitrary group H, then there is a unique homomorphism hf of G into H such 
that h(x) = y; it is defined by setting A(x”) = y” for all n. 

For later use, in §65 and in Chapters 13 and 14, we prove here a strengthened 
version of Theorem 54.4. 


*Theorem 54.6. Let p: E —> B be a covering map; let p(eo) = 
(a) The homomorphism p, : ™(E, eo) > 7(B, bo) is a monomorphism. 
(b) Let H = p,(7\(E, eo)). The lifting correspondence ġ induces an injective map 


© : 11(B, bo)/H > p~! (bo) 


of the collection of nght cosets of H into p~! (bo), which is bijective if E is path 
connected. 

(c) If f is a loop in B based at bo, then {f} € H if and only if f lifts to a loop in E 
based at eg. 


Proof. (a) Suppose hisa loop in E at eo, and p«((h]) is the - identity element. Let F 
be a path homotopy between p o h and the constant loop. If F is the lifting of F to E 
such that F(0,0) = eo, then F is a path homotopy between A and the constant loop 
at eo. 
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(b) Given loops f and g in B, let f and 2 be liftings of them to E that begin at eo. 
Then ¢({f]}) = f(1) and $((g]) = (1). We show that ¢((f}) = $([g}) if and only 
if [f] € H x» [8]. 

First, suppose that [f] € H x [g]. Then Uf] = [h x 8}, where h = p oh for some 
loop Å in E based at eo. Now the product h x 2 is defined, and it is a lifting of h « g. 
Because [f] = [h x g], the liftings f and h x 2, which begin at eo, must end at the 
same point of E. Then f and 3 end at the same point of E, so that ¢([f]) = #((g])- 
See Figure 54.3. 


Figure 54.3 


Now suppose that #((f]) = ¢([g]). Then f and g end at the same point of E. 
The product of f and the reverse of g is defined, and it is a loop À in E based at eo. 
By direct computation, [i « 2] = [f]. If F is a path homotopy in E between the loops 
hx and f, then po F is a path homotopy in B between h x g and f, where h = poh. 
Thus [f] € H x [g], as desired. 

If E is path connected, then ¢ is surjective, so that ® is surjective as well. 

(c) Injectivity of ® means that ¢([f]}) = ([g}) if and only if [f] € H x {g). 
Applying this result in the case where g is the constant loop, we see that ([f]) = eo 
if and only if [f] € H. But ġ([f]) = eo precisely when the lift of f that begins at eo 
also ends at eo. a 


Exercises 


1. What goes wrong with the “path-lifting lemma” (Lemma 54.1) for the local 
homeomorphism of Example 2 of §53? 


2. In defining the map F in the proof of Lemma 54.2, why were we so careful about 
the order in which we considered the small rectangles? 


3. Let p : E — B be a covering map. Let a and £ be paths in B with a(1) = B(0); 


let & and £ be liftings of them such that 4(1) = 8(0). Show that & »Ĝ is a lifting 
of a x$. 


348 The Fundamental Group ; Ch. 9 


4. Consider the covering map p : R x R} —> R? — 0 of Example 6 of §53. Find 


liftings of the paths 
f(t) = (2—1,0), 
g(t) = (1 +4) cos 2zt, (1 + ¢) sin2rt) 
h(t)= f ¥g. 


Sketch these paths and their liftings. 
5. Consider the covering map p x p : R x R —> S! x S! of Example 4 of §53. 
Consider the path 


F(t) = (cos 27t, sin 27t) x (cos 47t, sin 47?) 


in S! x S!. Sketch what f looks like when S! x S! is identified with the doughnut 
surface D. Find a lifting f of f to R x R, and sketch it. 

6. Consider the maps g,h : S! —> S! given g(z) = z” and A(z) = 1/2". (Here 
we represent S! as the set of complex numbers z of absolute value 1.) Compute 
the induced homomorphisms g,, h. of the infinite cyclic group 71(S', bo) into 
itself. [Hint: Recall the equation (cos 0 + i sin@)" = cos n8 +i sinné.] 

7. Generalize the proof of Theorem 54.5 to show that the fundamental group of the 
torus is isomorphic to the group Z x Z. 

8. Let p : E — B be a covering map, with E path connected. Show that if B is 
simply connected, then p is a homeomorphism. 


$55 Retractions and Fixed Points 


We now prove several classical results of topology that follow from our knowledge of 
the fundamental group of S!. 


Definition. If A C X, aretraction of X onto A is a continuous map r : X —> A such 
that r|A is the identity map of A. If such a map r exists, we say that A is a retract 
of X. 


Lemma 55.1. If A is a retract of X, then the homomorphism of fundamental groups 
induced by inclusion j : A — X is injective. 

Proof. \fr:X — Aisa retraction, then the composite map r o j equals the identity 
map of A. It follows that r, o jẹ is the identity map of 7; (A, a), so that j, must be 
injective. a 


Theorem 55.2 (No-retraction theorem). There is no retraction of B? onto S'. 


Proof. If S! were a retract of B?, then the homomorphism induced by inclusion 
j : S! — B? would be injective. But the fundamental group of S! is nontrivial and 
the fundamental group of B? is trivial. a 
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Lemma 55.3. Leth : S! —> X be a continuous map. Then the following conditions 
are equivalent: 

(1) h is nulhomotopic. 

(2) h extends to a continuous map k : B? > X. 

(3) h, is the trivial homomorphism of fundamental groups. 


Proof. (1) = (2). Let H : S' x I —> X be a homotopy between h and a constant 
map. Let 1: S! x I + B? be the map 


n(x, t) =(1l—2)x. 


Then x is continuous, closed and surjective, so it is a quotient map; it collapses S! x 1 
to the point 0 and is otherwise injective. Because H is constant on S! x 1, it induces, 
via the quotient map 7r, a continuous map k : B? — X that is an extension of h. See 
Figure 55.1. 


1 
sxi N v a(S") 
k 


B? 
Figure 55.1 


(2) > (3). If j : S! + B? is the inclusion map, then h equals the composite ko j. 
Hence h, = k, o ją. But 


je: m (S!, bo) > mı (B°, bo) 


is trivial because the fundamental group of B? is trivial. Therefore h, is trivial. 

(3) => (1). Let p : R —> S! be the standard covering map, and let po: Z — S! be 
its restriction to the unit interval. Then [po] generates 7; (S}, bọ) because po is a loop 
in S! whose lift to R begins at 0 and ends at 1. 

Let xo = A(bo). Because h, is trivial, the loop f = h o po represents the identity 
element of 7; (X, xo). Therefore, there is a path homotopy F in X between f and the 
constant path at xo. The map po x id: | x I > S! x I is a quotient map, being 
continuous, closed, and surjective; it maps 0 x ¢ and | x £ to bọ x ż for each ¢, but 
is otherwise injective. The path homotopy F maps 0 x / and 1 x / and J x 1 to the 
point x9 of X, so it induces a continuous map H : S! x | -» X that is a homotopy 
between h and a constant map. See Figure 55.2. a 
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S'x! 


Figure 55.2 


Corollary 55.4. The inclusion map j : S! + R? — 0 is not nulhomotopic. The 
identity map i: S! — S! is not nulhomotopic. 

Proof. There is a retraction of R? — 0 onto S! given by the equation r(x) = x/||x\l. 
Therefore, j, is injective, and hence nontrivial. Similarly, i, is the identity homomor- 
phism, and hence nontrivial. a 


Theorem 55.5. Given a nonvanishing vector field on B?, there exists a point of S! 
where the vector field points directly inward and a point of S' where it points directly 
outward. 


Proof. A vector field on B? is an ordered pair (x, v(x)), where x is in B? and v isa 
continuous map of B? into R?. In calculus, one often uses the notation 


v(x) = vi(x)i+ v2(x)j 


for the function v, where i and j are the standard unit basis vectors in R?. But we shall 
stick with simple functional notation. To say that a vector field is nonvanishing means 
that v(x) Æ 0 for every x; in such a case v actually maps B? into R? — 0. 

We suppose first that v(x) does not point directly inward at any point x of S! and 
derive a contradiction. Consider the map v : B? —> R? — 0; let w be its restriction to 
S!. Because the map w extends to a map of B? into R? — 0, it is nulhomotopic. 

On the other hand, w is homotopic to the inclusion map j : S! + R? — 0. 
Figure 55.3 illustrates the homotopy; one defines it formally by the equation 


F(x,t) =tx + (1 —t)w(x), 


for x € S!. We must show that F(x, t) 4 0. Clearly, F(x, t) #0 fort =Oands = 1. 
If F(x,t) = 0 for some t with O < £ < 1, then tx + (1 — t)w(x) = 0, so that w(x) 
equals a negative scalar multiple of x. But this means that w(x) points directly inward 
at x! Hence F maps S! x / into R? — 0, as desired. 

It follows that j is nulhomotopic, contradicting the preceding corollary. 

To show that v points directly outward at some point of S', we apply the result 
just proved to the vector field (x, —v(x)). a 
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Figure 55.3 


We have already seen that every continuous map f : (0, 1] —> {0, I] has a fixed 
point (see Exercise 3 of §24). The same is true for the ball B?, although the proof is 
deeper: 


Theorem 55.6 (Brouwer fixed-point theorem for the disc). If f : B? —> B? is 
continuous, then there exists a point x € B? such that f (x) = x. 


Proof. We proceed by contradiction. Suppose that f (x) Æ x for every x in B?. Then 
defining v(x) = f(x) — x gives us a nonvanishing vector field (x, v(x)) on B*. But 
the vector field v cannot point directly outward at any point x of S!, for that would 
mean 


f(x) -—x =ax 


for some positive real number a, so that f(x) = (1 + a)x would lie outside the unit 
ball B?. We thus arrive at a contradiction. E 


One might weli wonder why fixed-point theorems are of interest in mathematics. It 
turns out that many problems, such as problems concerning existence of solutions for 
systems of equations, for instance, can be formulated as fixed-point problems. Here is 
one example, a classical theorem of Frobenius. We assume some knowledge of linear 
algebra at this point. 


*Corollary 55.7. Let A be a3 by 3 matrix of positive real numbers. Then A has a 
positive real eigenvalue (characteristic value). 


Proof. LetT : R? — R? be the linear transformation whose matrix (relative to the 
standard basis for R°) is A. Let B be the intersection of the 2-sphere $? with the first 


352 The Fundamental Group Ch. 9 


octant 
{(x1, x2, x3) | xı > O and x2 > 0 and x3 > 0} 


of R?. It is easy to show that B is homeomorphic to the ball B?, so that the fixed-point 
theorem holds for continuous maps of B into itself. 

Now if x = (x1, x2, x3) is in B, then all the components of x are nonnegative and 
at least one is positive. Because all entries of A are positive, the vector T (x) is a vector 
all of whose components are positive. As a result, the map x > T(x)/|IT(x)|| is a 
continuous map of B to itself, which therefore has a fixed point xo. Then 


T (xo) = IIT (x0)llx0, 


so that T (and therefore the matrix A) has the positive real eigenvalue || T (xo) il. a 


Finally, we prove a theorem that implies that the triangular region 
T = {(x, y) |x > Oand y > Oandx+y <1} 


in R? has topological dimension at least 2. (See §50.) 


*Theorem 55.8. There is ane > 0 such that for every open covering A of T by sets 
of diameter less than €, some point of T belongs to at least three elements of A. 


Proof. We use the fact that T is homeomorphic to B?, so that we can apply the results 
proved in this section to the space T. 

Choose € > 0 so that no set of diameter less than € intersects all three edges of T. 
(In fact, € = i will do.) We suppose that A = {U,,..., Un} is an open covering of T 
by sets of diameter less than €, such that no three elements of A intersect, and derive 
a contradiction. 

For each i = 1, ..., n, choose a vertex v; of T as follows: If U; intersects two 
edges of T, let v; be the vertex common to these edges. If U; intersects only one edge 
of T, let v; be one of the end points of this edge. If U; intersects no edge of T, let v; 
be any vertex of T. 

Now let {@;} be a partition of unity dominated by {U;,..., Un}. (See §36.) Define 
k : T — R? by the equation 


k(x) = SO gi. 
i=l 


Then k is continuous. Given a point x of T, it lies in at most two elements of A; hence 
at most two of the numbers ¢;(x) are nonzero. Then k(x) = v; if x lies in only one 
open set Uj, and k(x) = tv, + (1—t)u; for some £ with O < £ < 1 if x lies in two open 
sets U; and U;. In either case, k(x) belongs to the union of the edges of T, which is 
BdT. Thus k maps T into Bd T. 
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Furthermore, k maps each edge of T into itself. For if x belongs to the edge vw 
of T, any open set U; containing x intersects this edge, so that v; must equal either v 
or w. The definition of k then shows that k(x) belongs to uw. 

Let h : BAT —> BdT be the restriction of k to BdT. Since h can be extended 
to the continuous map k, it is nulhomotopic. On the other hand, h is homotopic to 
the identity map of Bd T to itself; indeed, since h maps each edge of T into itself, the 
straight-line homotopy between h and the identity map of Bd T is such a homotopy. 
But the identity map i of BdT is not nulhomotopic. Ë 


Exercises 


1. Show that if A is a retract of B?, then every continuous map f : A — A has a 
fixed point. 


2. Show that if h : S! > S! is nulhomotopic, then A has a fixed point and h maps 
some point x to its antipode —x. 


3. Show that if A is a nonsingular 3 by 3 matrix having nonnegative entries, then A 
has a positive real eigenvalue. 


4. Suppose that you are given the fact that for each n, there is no retraction r : 
B"+! _, S". (This result can be proved using more advanced techniques of 
algebraic topology.) Prove the following: 

(a) The identity map i : S” — S” is not nulhomotopic. 

(b) The inclusion map j : 5" — R"+! — 0 is not nulhomotopic. 

(c) Every nonvanishing vector field on B”+! points directly outward at some 
point of $”, and directly inward at some point of $”. 

(d) Every continuous map f : B"+! > B"t! has a fixed point. 

(e) Every n + 1 by n + 1 matrix with positive real entries has a positive eigen- 
value. 

(f) If : S” — S" is nulhomotopic, then k has a fixed point and h maps some 
point x to its antipode —x. 


*§56 The Fundamental Theorem of Algebra 


It is a basic fact about the complex numbers that every polynomial equation 
x" anx"! +- ax +a9 =0 


of degree n with real or complex coefficients has n roots (if the roots are counted 
according to their multiplicities). You probably first were told this fact in high school 
algebra, although it is doubtful that it was proved for you at that time. 

The proof is, in fact, rather hard; the most difficult part is to prove that every 
polynomial equation of positive degree has at least one root. There are various ways 
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of doing this. One can use only techniques of algebra; this proof is long and arduous. 
Or one can develop the theory of analytic functions of a complex variable to the point 
where it becomes a trivial corollary of Liouville’s theorem. Or one can prove it as a 
relatively easy corollary of our computation of the fundamental group of the circle; 
this we do now. 


Theorem 56.1 (The fundamental theorem of algebra). A polynomial equation 
x" Fä! +---+ax+a0=0 


of degree n > 0 with real or complex coefficients has at least one (real or complex) 
root. 


Proof. Step 1. Consider the map f : S! —> S! given by f(z) = z”, where z is a 
complex number. We show that the induced homomorphism f, of fundamental groups 
is injective. 
Let po : 7 — S! be the standard loop in S', 
pols) = e?" = (cos 2ms, sin 2ms). 
Its image under f, is the loop 


f(po(s)) = (e775)" = (cos 2ns, sin 2mns). 


This loop lifts to the path s — ns in the covering space R. Therefore, the loop f © po 
corresponds to the integer n under the standard isomorphism of 7,(S', bo) with the 
integers, whereas po corresponds to the number 1. Thus f, is “multiplication by n” in 
the fundamental group of S}, so that in particular, f, is injective. 

Step 2. We show that if g : S! > R? — Ois the map g(z) = z”, then g is not 
nulhomotopic. 

The map g equals the map f of Step 1 followed by the inclusion map j : $! = 
R? — 0. Now f. is injective, and j, is injective because S! is a retract of R? — 0. 
Therefore, g, = jẹ © fx is injective. Thus g cannot be nulhomotopic. 


Step 3. Now we prove a special case of the theorem. Given a polynomial equation 
x" +an-1x"7! +--+ + aix + a9 =0, 
we assume that 
l@n—1] + +--+ lai] + laol] < 1 


and show that the equation has a root lying in the unit ball B?. 
Assume it has no such root. Then we can define a map k : B? —> R? — 0 by the 
equation 


k(z) = 2" +@q-12") +++ Faiz +00. 
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Let A be the restriction of k to S'. Because h extends to a map of the unit ball into 
R? — 0, the map A is nulhomotopic. 

On the other hand, we shall define a homotopy F between h and the map g of 
Step 2; since g is not nulhomotopic, we have a contradiction. We define F : $! x I > 
R? — 0 by the equation 


F(z,0) = 2" +t(aq—12" | 


+--+ 49). 
See Figure 56.1; F(z, £) never equals 0 because 
[F(z, DI = l2"| — lt (an-12”7! + --- + a0)| 
> 1 —r(lan-1z"7!] + --- + laol) 
= | — t(lan-1} +--- + laol) > 0. 


St 


Figure 56.1 


Step 4. Now we prove the general case. Given a polynomial equation 
x" +an-1x"7! +--+- +aix +a =0, 
let us choose a real number c > 0 and substitute x = cy. We obtain the equation 
(cy)" + an-ı (cy)"T! +--+ + ailcy) +a = 0 


or 


If this equation has the root y = yo, then the original equation has the root xo = cyo. 
So we need merely choose c large enough that 
an-| al 


an-2 A 
c" 


<l 
c2 


|+ |+ |+ a 
c c 


to reduce the theorem to the special case considered in Step 3. a 
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Exercises 


1. Given a polynomial equation 


th. 4 ayx+a9 =0 


x” + ay x" | 
with real or complex coefficients. Show that if Jan-1] + --- + jail + lao] < 1, 
then all the roots of the equation lie interior to the unit ball B?. (Hint: Let 
g(x) = l +an-1x +--+ ax"! + ax", and show that g(x) # 0 for x € B?.] 


2. Find a circle about the origin containing all the roots of the polynomial equation 
x? 4+x2741=0. 


*§57 The Borsuk-Ulam Theorem 


Here is a ““brain-teaser” problem: Suppose you are given a bounded polygonal region A 
in the plane R?. No matter what shape A has, it is easy to show that there exists a 
straight line that bisects A, that is, one that cuts the area of A in half. Simply take the 
horizontal line y = c, let f(c) denote the area of that part of A that lies beneath this 
line, note that f is a continuous function of c, and use the intermediate-value theorem 
to find a value of c for which f(c) equals exactly half the area of A. 

But now suppose instead that you are given two such regions A; and A2, you are 
asked to find a single line that bisects them both. It is not obvious even that there 
exists such a line. Try to find one for an arbitrary pair of triangular regions if you have 
doubts! 

In fact, such a line always exists. This result is a corollary of a well-known theorem 
called the Borsuk-Ulam theorem, to which we now turn. 


Definition. If x is a point of S", then its antipode is the point —x. A map h : S” > 
S™ is said to be antipode-preserving if h(—x) = —h(x) forall x € S". 


Theorem 57.1. [fh : S! — S! is continuous and antipode-preserving, then h is not 
nulhomotopic. 


Proof. Let bo be the point (1, 0) of S}. Let p : S! — S! be a rotation of $! that maps 
h(bo) to bo. Since p preserves antipodes, so does the composite p o h. Furthermore, if 
H were a homotopy between h and a constant map, then p o H would be a homotopy 
between p o h and a constant map. Therefore, it suffices to prove the theorem under 
the additional hypothesis that 4(b9) = bo. 


Step 1. Letq: S! = S! be the map q(z) = z?, where zis a complex number. Or 
in real coordinates, g(cos@, sin@) = (cos 20, sin28). The map q is a quotient map, 
being continuous, closed, and surjective. The inverse image under q of any point of S! 
consists of two antipodal points z and —z of S!. Because A(—z) = —h(z), one has the 
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equation g(h(—z)) = q(h(z)). Therefore, because q is a quotient map, the map q oh 
induces a continuous map k : S! — S! such that kog =q oh. 


s! Agi 
ee 
si 7 > gs! 


Note that q (bo) = h (bo) = bo, so that k(bọ) = bo as well. Also, h(—bo) = —bo. 

Step 2. We show that the homomorphism k, of 7r; (S}, bo) with itself is nontrivial. 

For this purpose, we first show that g is a covering map. (We gave this as an 
exercise in §53.) The proof is similar to the proof that the standard map p : R — S! is 
a covering map. If, for instance, U is the subset of S! consisting of those points having 
positive second coordinate, then p~'(U) consist of those points of S! lying in the first 
and third quadrants of R?. The map q carries each of these sets homeomorphically 
onto U. Similar arguments apply when U is the intersection of $! with the open lower 
half-plane, or with the open right and left half-planes. 

Second, we note that if f is any path in S! from bo to —bp, then the loop f = go f 
represents a nontrivial element of 7(S! , bo). For f is a lifting of f to S! that begins 
at bo and does not end at bo. 

Finally, we show k, is nontrivial. Let f be a path in S! from bọ to —bo, and let f 
be the loop q o f. Then k,[ f] is not trivial, for k,{[ f] = [k o (q o f)] = [q o (h o f); 
the latter is nontrivial because h o f is a path in S! from bo to —bọ. 

Step 3. Finally, we show that the homomorphism h, is nontrivial, so that A cannot 
be nulhomotopic. 

The homomorphism k, is injective, being a nontrivial homomorphism of an in- 
finite cyclic group with itself. The homomorphism q+ is also injective; indeed, q, 
corresponds to multiplication by two in the group of integers. It follows that k, o g, is 
injective. Since q, oh, = k, o q+, the homomorphism A, must be injective as well. W 


g 
C ) 
3 s' 
Figure 57.1 


Theorem 57.2. There is no continuous antipode-preserving map g : S? —> S'. 


Proof. Suppose g : S? —> S! is continuous and antipode preserving. Let us take $! to 
be the equator of S?. Then the restriction of g to S! is a continuous antipode-preserving 
map h of S! to itself. By the preceding theorem, h is not nulhomotopic. But the upper 
hemisphere E of S? is homeomorphic to the ball B?, and g is a continuous extension 
of h to E! See Figure 57.1. a 
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Theorem 57.3 (Borsuk-Ulam theorem for S*). Given a continuous map f : S? > 
R?, there is a point x of $? such that f(x) = f(—x). 


Proof. Suppose that f(x) # f(—x) for all x € S*. Then the map 
g(x) = [f œ) — f(-)V/IF@) -— f(x) 


is a continuous map g : $? > S! such that g(—x) = —g(x) for all x. n 


Theorem 57.4 (The bisection theorem). Given two bounded polygonal regions 
in R?, there exists a line in R? that bisects each of them. 


Proof. We take two bounded polygonal regions A; and A3 in the plane R? x I in R3, 
and show there is a line L in this plane that bisects each of them. 

Given a point u of $?, let us consider the plane P in R? passing through the origin 
that has u as its unit normal vector. This plane divides R? into two half-spaces; let 
fi(u) equal the area of that portion of A; that lies on the same side of P as does the 
vector u. 

If u is the unit vector k, then f,(u) = area A;; and if u = —k, then fj(u) = 0. 
Otherwise, the plane P intersects the plane R? x 1 ina line L that splits R? x 1 into 
two half-planes, and fj (u) is the area of that part of A; that lies on one side of this line. 
See Figure 57.2. 


Figure 57.2 


Replacing u by —u gives us the same plane P, but the other half-space, so that 
fi(—u) is the area of that part of A; that lies on the other side of P from u. It follows 
that 


filu) + fi(—u) = area Aj. 
Now consider the map F : S? —> R? given by F(u) = (filu), fo(u)). The 


Borsuk-Ulam theorem gives us a point u of S? for which F(u) = F(—u). Then 
fiu) = fi(—u) fori = 1, 2, that f; (u) = larea Aj, as desired. a 
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We have proved the bisection theorem for bounded polygonal regions in the plane. 
However, all that was needed in the proof was the existence of an additive area function 
for A, and A2. Thus, the theorem holds for any two sets A, and Az that are “Jordan- 
measurable” in the sense used in analysis. 

These theorems generalize to higher dimensions, but the proofs are considerably 
more sophisticated. The generalized version of the bisection theorem states that given 
n Jordan-measurable sets in R”, there exists a plane of dimension n — | that bisects 
them all. In the case n = 3, this result goes by the pleasant name of the “ham sandwich 
theorem.” If one considers a ham sandwich to consist of two pieces of bread and a slab 
of ham, then the bisection theorem says that one can divide each of them precisely in 
half with a single whack of a cleaver! 


Exercises 


1. Prove the following “theorem of meteorology”: At any given moment in time, 
there exists a pair of antipodal points on the surface of the earth at which both 
the temperature and the barometric pressure are equal. 


2. Show that if g : S? — S? is continuous and g(x) # g(—x) for all x, then g is 
surjective. (Hint: If p € S?, then $? — {p} is homeomorphic to R?.] 


3. Leth : S! — S! be continuous and antipode-preserving with h(bp) = bp. Show 
that h, carries a generator of 7, (S!, bo) to an odd power of itself. [Hint: If k is 
the map constructed in the proof of Theorem 57.1, show that k, does the same.] 


4. Suppose you are given the fact that for each n, no continuous antipode-preserving 
map h : $” — S" is nulhomotopic. (This result can be proved using more 
advanced techniques of algebraic topology.) Prove the following: 

(a) There is no retraction r : B"t! —> S”. 

(b) There is no continuous antipode-preserving map g : S"+! + S". 

(c) (Borsuk-Ulam theorem) Given a continuous map f : S”+! > R"+! there 
is a point x of S"+! such that f(x) = f(—x). 

(d) If Aj, ..., An41 are bounded measurable sets in R”+!, there exists an n- 
plane in R”+! that bisects each of them. 
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As we have seen, one way of obtaining information about the fundamental group of 
a space X is to study the covering spaces of X. Another is one we discuss in this 
section, which involves the notion of homotopy type. It provides a method for reducing 
the problem of computing the fundamental group of a space to that of computing the 
fundamental group of some other space—preferably, one that is more familiar. 

We begin with a lemma. 
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Lemma 58.1. Let h,k : (X,x0) —> (Y, yo) be continuous maps. If h and k are 
homotopic, and if the image of the base point xo of X remains fixed at yo during the 
homotopy, then the homomorphisms h, and k, are equal. 


Proof. The proof is immediate. By assumption, there is a homotopy H : X x I > Y 
between h and k such that H (xo, t) = yo for all t. It follows that if f is a loop in X 
based at xo, then the composite 
id 
xi 2S xxi y 
is a homotopy between ho f and ko f; it is a path homotopy because f is a loop at xo 
and H maps xo x / to yo. | 


Using this lemma, we generalize a result about the space R? — 0 proved earlier, 
proving that the homomorphism induced by inclusion j : $! + R? — 0 is not only 
injective but surjective as well. More generally, we prove the following: 


Theorem 58.2. The inclusion map j : S" + R”+! — 0 induces an isomorphism of 
fundamental groups. 
Proof. Let X = R"+! — 0; let bp = (1,0,...,0). Let r : X — S" be the map 
r(x) = x/|lx{]. Then r o j is the identity map of S", so that rẹ o jẹ is the identity 
homomorphism of 71 (S", bo). 

Now consider the composite j o r, which maps X to itself, 


xs yx. 


This map is not the identity map of X, but it is homotopic to the identity map. Indeed, 
the straight-line homotopy H : X x | — X, given by 


H(x,t) = (1 — t)x + tx/ xh, 


is a homotopy between the identity map of X and the map j or. For H(x,t) is 
never equal to 0, because (1 — £t) + t/||x|| is a number between 1 and 1/{([x|j. Note 


that the point bo remains fixed during the homotopy, since |jbọl = 1. It follows 
from the preceding lemma that the homomorphism (j or). = js or, is the identity 
homomorphism of 7; (X, bo). a 


What made the preceding proof work? Roughly speaking, it worked because we 
had a natural way of deforming the identity map of R"+! — 0 to a map that collapsed 
all of R"+! — 0 onto S”. The deformation H gradually collapsed each radial line em- 
anating from the origin to the point where it intersected 5”; each point of $" remained 
fixed during this deformation. 

Figure 58.1 illustrates, in the case n = 1, how the deformation H gives rise to a 
path homotopy H(f(s), t) between the loop f in R? — 0 and the loop g = f/IfN 
in S`. 
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Figure 58.1 


These comments lead us to formulate a more general situation in which the same 
procedure applies. 


Definition. Let A be a subspace of X. We say that A is a deformation retract of X if 
the identity map of X is homotopic to a map that carries all of X into A, such that each 
point of A remains fixed during the homotopy. This means that there is a continuous 
map H : X x I — X such that H(x,0) = x and H(x, i) € A for all x € X, and 
H(a,t) =a forall a € A. The homotopy H is called a deformation retraction of X 
onto A. The mapr : X — A defined by the equation r(x) = H (x, 1) is a retraction 
of X onto A, and H is a homotopy between the identity map of X and the map j or, 
where j : A > X is inclusion. 


The proof of the preceding theorem generalizes immediately to prove the follow- 
ing: 


Theorem 58.3. Let A be a deformation retract of X ; let x9 € A. Then the inclusion 
map 


j: (A, x0) —> (X, xo) 


induces an isomorphism of fundamental groups. 
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EXAMPLE |. Let B denote the z-axis in R? Consider the space R? — B. It has the 
punctured xy-plane (R? — 0) x O as a deformation retract. The map H defined by the 
equation 


A(x, y, 2,0) = (x, y. (1 — t)z) 


is a deformation retraction; it gradually collapses each line parallel to the z-axis into the 
point where the line intersects the xy-plane. We conclude that the space R? — B has an 
infinite cyclic fundamental group. 


EXAMPLE 2. Consider R? — p — q, the doubly punctured plane. We assert it has 
the “figure eight” space as a deformation retract. Rather than writing equations, we merely 
sketch the deformation retraction; it is the three-stage deformation indicated in Figure 58.2. 


vo 
2o 


Figure 58.2 


EXAMPLE 3. Another deformation retract of R? — p — q is the “theta space” 
8 =S! U (0x [-1, 1); 


we leave it to you to Sketch the maps involved. As a result, the figure eight and the theta 
space have isomorphic fundamental groups, even though neither is a deformation retract of 


the other. 
Of course, we do not know anything about the fundamental group of the figure eight 


as yet. But we shall. 


The example of the figure eight and the theta space suggests the possibility that 
there might be a more general way of showing two spaces have isomorphic fundamen- 
tal groups than by showing that one is homeomorphic to a deformation retract of the 
other. We formulate such a notion now. 
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Definition. Let f : X — Y and g : Y — X be continuous maps. Suppose that the 
map go f : X — X is homotopic to the identity map of X, and the map fog : Y > Y 
is homotopic to the identity map of Y. Then the maps f and g are called homotopy 
equivalences, and each is said to be a homotopy inverse of the other. 


It is straightforward to show that if f : X — Y is a homotopy equivalence of X 
with Y and h : Y —> Z is a homotopy equivalence of Y with Z, thenho f :X > Z 
is a homotopy equivalence of X with Z. It follows that the relation of homotopy 
equivalence is an equivalence relation. Two spaces that are homotopy equi valent are 
said to have the same homotopy type. 

Note that if A is a deformation retract of X, then A has the same homotopy type 
as X. For let j : A — X be the inclusion mapping and let r : X — A be the retraction 
mapping. Then the composite r o j equals the identity map of A, and the composite 
j or is by hypothesis homotopic to the identity map of X (and in fact each point of A 
remains fixed during the homotopy). 

We now show that two spaces having the same homotopy type have isomorphic 
fundamental groups. For this purpose, we need to study what happens when we have 
a homotopy between two continuous maps of X into Y such that the base point of X 
does not remain fixed during the homotopy. 


Lemma 58.4. Leth, k : X — Y be continuous maps; let h(xo) = yo andk(xg) = yy. 
If h and k are homotopic, there is a path æ in Y from yo to yı such that k, = & o hy. 
Indeed, if H : X x I — Y is the homotopy between h and k, then a is the path 
a(t) = H(xọ, £). 


m1 (X, xo) A m™(Y, yo) 
EO 
m(Y, yi) 
Proof. Let f : 1 — X be a loop in X based at xo. We must show that 
klf) = ah. (Lf). 
This equation states that [k o f] = [a] * [h o f) * [æ], or equivalently, that 
la] * [ko f] = [h o f]* [a]. 


This is the equation we shall verify. 
To begin, consider the loops fo and fı in the space X x / given by the equations 


fots) = (F(5),0) and f(s) = (f(s), 1). 
Consider also the path c in X x / given by the equation 


c(t) = (x0, £). 
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= 
BS 


x] Xxi 


Figure 58.3 


Then Ho fo = ho f and H o fı = kof, while H oc equals the path a. See Figure 58.3. 

Let F : I x 1 — X x I be the map F(s, t) = (f(s), t). Consider the following 
paths in / x /, which run along the four edges of J x /: 

Bo(s) =(s,0) and Ay (s)= (5,1), 
y(t) = (0,1) and y(t) = (1,4). 
Then F o fo = fo and F o B, = fi, while F o w = Foy, =c. 

The broken-line paths Bo * yı and yo * B; are paths in / x Z from (0, 0) to (1, 1); 
since / x 7 is convex, there is a path homotopy G between them. Then F oG is a path 
homotopy in X x 7 between fo * cand c * fı. And H o (F o G) is a path homotopy 
in Y between 

(Ho fo)*(Hoc)=(hof)*a and 
(A oc) «(Ho fi) =a (ko f), 


as desired. P| 


Corollary 58.5. Leth, k : X — Y be homotopic continuous maps; let h(xp) = yo 
and k(xo) = yı. Ifh, is injective, or surjective, or trivial, so is ką. 


Corollary 58.6. Leth : X — Y. Ifh is nuthomotopic, then h, is the trivial homo- 
morphism. 


Proof. The constant map induces the trivial homomorphism. a 


Theorem 58.7. Let f : X — Y be continuous; let f(xo) = yo. If f is a homotopy 
equivalence, then 


fe: TCX, xo) — m (Y, yo) 


is an isomorphism. 
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Proof. Let g: Y — X be a homotopy inverse for f. Consider the maps 


(X, x0) > (Y, y) > (X, x) > C.Y), 


where x; = g(yo) and yı = f(xı). We have the corresponding induced homomor- 
phisms: 


(r) 
mi (X, xo) —> mi(¥, yo) 
gs 


(frye 
m(X, x1) ——> m (Y, y1) 


(Here we have to distinguish between the homomorphisms induced by f relative to 
two different base points.] Now 


go f : (X, x0) — (X, x1) 
is by hypothesis homotopic to the identity map, so there is a path a in X such that 
(2° fe =@0 (ix). =å. 


It follows that (g o f) = 8+ © ( fxo)+ is an isomorphism. 

Similarly, because f o g is homotopic to the identity map iy, the homomorphism 
(f 9 g)s = (fx,)« © Bx is an isomorphism. 

The first fact implies that g, is surjective, and the second implies that g, is in- 
jective. Therefore, g, is an isomorphism. Applying the first equation once again, we 
conclude that 


(eH (8) oå, 


so that ( fxọ)+ is also an isomorphism. 
Note that although g is a homotopy inverse for f, the homomorphism g, is not an 
inverse for the homomorphism (fix). a 


The relation of homotopy equivalence is clearly more general than the notion of 
deformation retraction. The theta space and the figure eight are both deformation 
retracts of the doubly punctured plane. Therefore, they are homotopy equivalent to the 
doubly punctured plane, and hence to each other. But neither is homeomorphic to a 
deformation retract of the other; in fact, neither of them can even be imbedded in the 
other. 

It is a striking fact that the situation that occurs for these two spaces is the standard 
situation regarding homotopy equivalences. Martin Fuchs has proved a theorem to the 
effect that two spaces X and Y have the same homotopy type if and only if they are 
homeomorphic to deformation retracts of a single space Z. The proof, although it uses 
only elementary tools, is difficult [F]. 
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Exercises 


1. 


*8. 


Show that if A is a deformation retract of X, and B is a deformation retract of A, 
then B is a deformation retract of X. 


. For each of the following spaces, the fundamental group is either trivial, infinite 


cyclic, or isomorphic to the fundamental group of the figure eight. Determine for 
each space which of the three alternatives holds. 

(a) The “solid torus,” B? x S!. 

(b) The torus T with a point removed. 

(c) The cylinder S! x J. 

(d) The infinite cylinder S' x R. 

(e) R? with the nonnegative x, y, and z axes deleted. 

The following subsets of R?: 


(f) {x | Well > 1} 
(g) {x | lxil = 1} 
(h) {x | ixil < 1} 


G) S! U(R4 x 0) 
G) S! U (R4 x R) 
(k) S'U (R x 0) 

A) R? — (R4 x 0) 


. Show that given a collection C of spaces, the relation of homotopy equivalence 


is an equivalence relation on C. 


. Let X be the figure eight and let Y be the theta space. Describe maps f : X > Y 


and g : Y — X that are homotopy inverse to each other. 


. Recall that a space X is said to be contractible if the identity map of X to itself 


is nulhomotopic. Show that X is contractible if and only if X has the homotopy 
type of a one-point space. 


. Show that a retract of a contractible space is contractible. 
. Let A be a subspace of X; let j : A —> X be the inclusion map, and let f : X > 


A be a continuous map. Suppose there is a homotopy H : X x | — X between 
the map j o f and the identity map of X. 

(a) Show that if f is a retraction, then jẹ is an isomorphism. 

(b) Show that if H maps A x / into A, then j, is an isomorphism. 

(c) Give an example in which jẹ, is not an isomorphism. 

Find a space X and a point xo of X such that inclusion {xp} —> X is a homotopy 
equivalence, but {xo} is not a deformation retract of X. [Hint: Let X be the 
subspace of R? that is the union of the line segments (1/n) x /, for n € Z4, the 
line segment 0 x J, and the line segment 7 x 0; let xo be the point (0, 1). If {xo} 
is a deformation retract of X, show that for any neighborhood U of x9, the path 
component of U containing xo contains a neighborhood of xo.] 


. We define the degree of a continuous map h : S! —> S} as follows: 


Let bo be the point (1,0) of S'; choose a generator y for the infinite cyclic 
group xı (S!, bo). If xo is any point of S}, choose a path a in S! from bo to xo, 
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and define y (xo) = &(y). Then y (xo) generates 7(S', xo). The element y (xo) 
is independent of the choice of the path a, since the fundamental group of S! is 
abelian. 

Now given h : S' —> S}, choose xo € S! and let h(xp) = xı. Consider the 
homomorphism 


ha : m (S, x9) — m(S', x1). 
Since both groups are infinite cyclic, we have 
(*) he(y(%0)) =d - y (x1) 


for some integer d, if the group is written additively. The integer d is called the 

degree of h and is denoted by deg h. 

The degree of h is independent of the choice of the generator y; choosing the 

other generator would merely change the sign of both sides of (x). 

(a) Show that d is independent of the choice of xp. 

(b) Show that if h, k : S! — S} are homotopic, they have the same degree. 

(c) Show that deg(h o k) = (deg h) - (deg k). 

(d) Compute the degrees of the constant map, the identity map, the reflection 
map (x1, x2) = (x1, —x2), and the map A(z) = z", where z is a complex 
number. 

*(e) Show that if h,k : S! > S! have the same degree, they are homotopic. 


10. Suppose that to every map h : S$” — S” we have assigned an integer, denoted 


by deg A and called the degree of h, such that: 

(i) Homotopic maps have the same degree. 

(ii) deg(h o k) = (deg h) - (deg k). 
(iii) The identity map has degree 1, any constant map has degree O, and the 

reflection map p(x1,...,Xn¢1) = (X1, <- -, Xn, ~X%n41) has degree —1. 
(One can construct such a function, using the tools of algebraic topology. Intu- 
itively, deg h measures how many times A wraps $” about itself; the sign tells 
you whether h preserves orientation or not.] Prove the following: 
(a) There is no retraction r : B’t! — S". 
(b) If h : S” — S" has degree different from (—1)"*!, then A has a fixed point. 
[Hint: Show that if h has no fixed point, then h is homotopic to the antipodal 


map a(x) = —x.] 
(c) Ifh : S” —> S” has degree different from |, then A maps some point x to its 
antipode —x. 


(d) If S” has a nonvanishing tangent vector field v, then n is odd. [Hint: If v 
exists, Show the identity map is homotopic to the antipodal map.] 
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§59 The Fundamental Group of S” 


Now we tum to a problem mentioned at the beginning of the chapter, the problem 
of showing that the sphere, torus, and double torus are surfaces that are topologically 
distinct. We begin with the sphere; we show that S” is simply connected for n > 2. 
The crucial result we need is stated in the following theorem. 


Theorem 59.1. Suppose X = UUV, where U and V are open sets of X. Suppose that 
U N V is path connected, and that x9 € U N V. Leti and j be the inclusion mappings 
of U and V, respectively, into X. Then the images of the induced homomorphisms 


is: xı(U., xo) > my (X, xo) and j«:ni(V,xo)— 711 (X, xo) 
generate x, (X, xo). 


Proof. This theorem states that, given any loop f in X based at xo, it is path homo- 
topic to a product of the form (g; * (g2 * (--- * 8n))), where each g; is a loop in X 
based at xg that lies either in U or in V. 

Step 1. We show there is a subdivision ag < a; < --- < an of the unit interval 
such that f(a;) € UNV and f ([a;-;, a;]) is contained either in U or in V, for each i. 

To begin, choose a subdivision bo, ..., bm of [0, 1] such that for each i, the set 
f({bj—-;, bil) is contained in either U or V. (Use the Lebesgue number lemma.) If 
f{ (bj) belongs to U N V for each i, we are finished. If not, let i be an index such that 
f(bi) € UNV. Each of the sets f({b;-1, bi]) and f ([b;, bj41]) lies either in U or 
in V. If f(6;) € U, then both of these sets must lie in U; and if f(b;) € V, both of 
them must lie in V. In either case, we may delete b;, obtaining a new subdivision co, 
.. +, Cm—i that still satisfies the condition that f ({cj—1, ¢;]) is contained either in U or 
in V, for each i. 

A finite number of repetitions of this process leads to the desired subdivision. 

Step 2. We prove the theorem. Given f, let ao, ..., an be the subdivision con- 
structed in Step 1. Define f; to be the path in X that equals the positive linear map of 
(0, 1] onto [ai—1, a;] followed by f. Then f; is a path that lies either in U or in V, and 
by Theorem 51.3, 


(f) = (fil * (fa) *---* (fal: 


For each i, choose a path a; in U N V from xo to f(a,). (Here we use the fact that 
U N V is path connected.) Since f (ao) = f(a,) = xo, we can choose ag and a, to be 
the constant path at x9. See Figure 59.1. 

Now we set 


gi = (aj-1 * fi) * a; 


for each i. Then g; is a loop in X based at xo whose image lies either in U or in V. 
Direct computation shows that 


(gil * [g2] *--- * (en) = ffi] * (fl *--- * (fal. a 


§59 The Fundamental Group of S" 369 


Figure 59.1 


The preceding theorem is a special case of a famous theorem of topology called 
the Seifert-van Kampen theorem, which expresses the fundamental group of the space 
X = UUV quite generally, when UNV is path connected, in terms of the fundamental 
groups of U and V. We shall study this theorem in Chapter 11. 


Corollary 59.2. Suppose X = U U V, where U and V are open sets of X ; suppose 
U N V is nonempty and path connected. If U and V are simply connected, then X is 
simply connected. 


Theorem 59.3. Ifn > 2, the n-sphere S” is simply connected. 


Proof. Let p = (0,...,0,1) € R”+i and q = (0,...,0, —1) be the “north pole” 
and the “south pole” of S”, respectively. 

Step 1. We show that if n > 1, the punctured sphere S" — p is homeomorphic 
to R". 

Define f : (S" — p) —> R" by the equation 


fœ = fi.. tnt) = Fo tte 
— Xn+i 
The map f is called stereographic projection. (If one takes the straight line in R"+! 
passing through the north pole p and the point x of S” — p, then this line intersects the 
n-plane R” x0 C R"*! in the point f (x) x0.) One checks that f is a homeomorphism 
by showing that the map g : R” — (S” — p) given by 


ly) =8 (Yi, <- -3 Yn) = AY) - ne OY) Yn, L ty), 


where f(y) = 2/(1 + |lyll?), is a right and left inverse for f. 
Note that the reflection map (x1, .-., Xn+1) > (Xi, --, Xn, —Xn4i) defines a 
homeomorphism of $” — p with S” — q, so the latter is also homeomorphic to R”. 
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Step 2. We prove the theorem. Let U and V be the open sets U = S" — p and 
V = S" -q of S". 

Note first that for n > 1, the sphere $” is path connected. This follows from the 
fact that U and V are path connected (being homeomorphic to R") and have the point 
(1,0, ..., 0) of S” in common. 

Now we show that for n > 2, the sphere $” is simply connected. The spaces U 
and V are simply connected, being homeomorphic to R”. Their intersection equals 
S” — p — q, which is homeomorphic under stereographic projection to R” — 0. The 
latter space is path connected, for every point of R” — 0 can be joined to a point of 
5"! by a straight-line path, and S"—! is path connected if n > 2. Then the preceding 
corollary applies. a 


Exercises 


1. Let X be the union of two copies of S? having a single point in common. What 
is the fundamental group of X? Prove that your answer is correct. [Be careful! 
The union of two simply connected spaces having a point in common is not 
necessarily simply connected. See [S], p. 59.] 


2. Criticize the following “proof” that S? is simply connected: Let f be a loop 
in S? based at x9. Choose a point p of $? not lying in the image of f. Since 
S? — p is homeomorsphic with R?, and R? is simply connected, the loop f is path 
homotopic to the constant loop. 


3. (a) Show that R! and R” are not homeomorphic if n > 1. 
(b) Show that R? and R” are not homeomorphic if n > 2. 
It is, in fact, true that R” and R” are not homeomorphic if n # m, but the proof 
requires more advanced tools of algebraic topology. 


4. Assume the hypotheses of Theorem 59.1. 
(a) What can you say about the fundamental group of X if j, is the trivial ho- 
momorphism? If both i, and j, are trivial? 
(b) Give an example where i, and jẹ, are trivial but neither U nor V have trivial 
fundamental groups. 


§60 Fundamental Groups of Some Surfaces 


Recall that a surface is a Hausdorff space with a countable basis, each point of which 
has a neighborhood that is homeomorphic with an open subset of R?. Surfaces are of 
interest in various parts of mathematics, including geometry, topology, and complex 
analysis. We consider here several surfaces, including the torus and double torus, and 
show by comparing their fundamental groups that they are not homeomorphic. In a 
later chapter, we shall classify up to homeomorphism all compact surfaces. 
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First, we consider the torus. In an earlier exercise, we asked you to compute 
its fundamental group using the theory of covering spaces. Here, we compute its 
fundamental group by using a theorem about the fundamental group of a product space. 

Recall that if A and B are groups with operation -, then the cartesian product A x B 
is given a group structure by using the operation 


(a x b): (a x b’) = (a -a') x (b - b’). 
Recall also that if k : C — A and k : C — B are group homomorphisms, then the 
map  : C > A x B defined by ®(c)} = h(c) x k(c) is a group homomorphism. 
Theorem 60.1. x;(X x Y, xo x yo) is isomorphic with m,(X, xo) x n; (Y, yo). 


Proof. Letp:X xY + X andq : X x Y — Y be the projection mappings. If 
we use the base points indicated in the statement of the theorem, we have induced 
homomorphisms 


pa: ni(X x Y, xo x yo) —> m(X, x0), 
Gu : 1(X x Y, xo x yo) — n(Y, yo). 


We define a homomorphism 

D: my (X x Y, xo x yo) — m1(X, xo) x m (Y, yo) 
by the equation 

DSD = P(lfD x ge FD) = ip o f] x [go f]. 


We shall show that ® is an isomorphism. 

The map ® is surjective. Let g : 1 —> X be a loop based at xo; let h : | —> Y be 
a loop based at yp. We wish to show that the element [g] x [A] lies in the image of ®. 
Define f : 1 + X x Y by the equation 


f(s) = g(s) x h(s). 
Then f isa loop in X x Y based at xo x yo, and 
(fl) =Ipo f] x iq o f] = Ig] x [A], 


as desired. 

The kernel of ® vanishes. Suppose that f : I + X x Y is a loop in X x Y based 
at xo x yo and ®({f]) = [po f] x [q o f] is the identity element. This means that 
Po f =p ex andgo f ~p ey; let G and H be the respective path homotopies. Then 
the map F : I x I > X x Y defined by 


F(s, t) = G(s,t) x H(s,t) 


is a path homotopy between f and the constant loop based at xo x yo. a 
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Corollary 60.2. The fundamental group of the torus T = S! x S! is isomorphic to 
the group Z x Z. 


Now we define a surface called the projective plane and compute its fundamental 
group. 


Definition. The projective plane P? is the quotient space obtained from S? by iden- 
tifying each point x of $? with its antipodal point —x. 

The projective plane may not be a space that is familiar to you; it cannot be imbed- 
ded in R? and is thus difficult to visualize. It is, however, the fundamental object of 
study in projective geometry, just as the euclidean plane R? is in ordinary euclidean 
geometry. Topologists are primarily interested in it as an example of a surface. 


Theorem 60.3. The projective plane P? is a compact surface, and the quotient map 
p: S? > P? is a covering map. 

Proof. First we show that p is an open map. Let U be open in S?. Now the antipodal 
map a : $? —> S$? given by a(x) = —x is a homeomorphism of S?; hence a(U) is 
open in S*. Since 


p-'(p(U)) =U Ua(U), 


this set also is open in S*. Therefore, by definition, p(U) is open in P?. A similar 
proof shows that p is a closed map. 

Now we show that p is a covering map. Given a point y of P?, choose x € p~! (y). 
Then choose an €-neighborhood U of x in $? for some e < 1, using the euclidean 
metric d of R?. Then U contains no pair {z, a(z)} of antipodal points of S?, since 
d(z, a(z)) = 2. Asa result, the map 


p:U — pw) 
is bijective. Being continuous and open, it is a homeomorphism. Similarly, 
p:a(U) > p(a(U)) = p(U) 


is a homeomorphism. The set p~!(p(U)) is thus the union of the two disjoint open 
sets U and a(U), each of which is mapped homeomorphically by p onto p(U). Then 
p(U) is a neighborhood of p(x) = y that is evenly covered by p. 

Since S? has a countable basis {Un}, the space P? has a countable basis {p(U,)}. 

The fact that P? is Hausdorff follows from the fact that S? is normal and p is a 
closed map. (See Exercise 6 of §31.) Alternatively, one can give a direct proof: Let yı 
and yz be two points of P?. The set pon U p~!(y2) consists of four points; let 2€ 
be the minimum distance between them. Let U, be the €-neighborhood of one of the 
points of p~!(y,), and let U2 be the €-neighborhood of one of the points of p~'(y2). 
Then 


U,;Ua(Uj;) and U2 Ua(U2) 
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are disjoint. It follows that p(U;) and p(U2) are disjoint neighborhoods of yı and y2, 
respectively, in P?. 

Since S? is a surface and every point of P? has a neighborhood homeomorphic 
with an open subset of $?, the space P? is also a surface. C 


Corollary 60.4. \(P?, y) is a group of order 2. 


Proof. The projection p : S? —> P? is a covering map. Since S? is simply connected, 
we can apply Theorem 54.4, which tells us there is a bijective correspondence between 
nı (P?, y) and the set p` 1 (y). Since this set is a two-element set, x (P?, y) is a group 
of order 2. 

Any group of order 2 is isomorphic to Z/2, the integers mod 2, of course. a 


One can proceed similarly to define P”, for any n € Z4, as the space obtained 
from S" by identifying each point x with its antipode —x; it is called projective n- 
space. The proof of Theorem 60.3 goes through without change to prove that the 
projection p : S" > P" is a covering map. Then because S” is simply connected for 
n > 2, it follows that 7;(P", y) is a two-element group for n > 2. We leave it to you 
to figure out what happens when n = |. 

Now we study the double torus. We begin with a lemma about the figure eight. 


Lemma 60.5. The fundamental group of the figure eight is not abelian. 


Proof. Let X be the union of two circles A and B in R? whose intersection consists 
of the single point x9. We describe a certain covering space E of X. 

The space E is the subspace of the plane consisting of the x-axis and the y-axis, 
along with tiny circles tangent to these axes, one circle tangent to the x-axis at each 
nonzero integer point and one circle tangent to the y-axis at each nonzero integer point. 

The projection map p : E — X wraps the x-axis around the circle A and wraps 
the y-axis around the other circle B; in each case the integer points are mapped by p 
into the base point x9. Each circle tangent to an integer point on the x-axis is mapped 
homeomorphically by p onto B, while each circle tangent to an integer point on the 
y-axis is mapped homeomorphically onto A; in each case the point of tarngency is 
mapped onto the point x9. We leave it to you to check mentally that the map p is 
indeed a covering map. 

We could write this description down in equations if we wished, but the informal 
description seems to us easier to follow. 

Now let f : I —> E be the path f(s) = s x 0, going along the x-axis from the 
origin to the point 1 x 0. Let g : Z — E be the path g(s) = 0 x s, going along the 
y-axis from the origin to the point 0 x 1. Let f = po f and g = pož; then f and g are 
loops in the figure eight based at xo, going around the circles A and B, respectively. 
See Figure 60.1. 

We assert that f + g and g x f are not path homotopic, so that the fundamental 
group of the figure eight is not abelian. 
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Figure 60.1 


To prove this assertion, let us lift each of these to a path in E beginning at the 
origin. The path f x g lifts to a path that goes along the x-axis from the origin to } x 0 
and then goes once around the circle tangent to the x-axis at 1 x 0. On the other hand, 
the path g x f lifts to a path in £ that goes along the y-axis from the origin to 0 x 1, 
and then goes once around the circle tangent to the y-axis at 0 x 1. Since the lifted 
paths do not end at the same point, f x g and g * f cannot be path homotopic. a 


We shall prove later that the fundamental group of the figure eight is, in fact, the 
group that algebraists call the “free group on two generators.” 


Theorem 60.6. The fundamental group of the double torus is not abelian. 


Proof. The double torus T#T is the surface obtained by taking two copies of the 
torus, deleting a small open disc from each of them, and pasting the remaining pieces 
together along their edges. We assert that the figure eight X is a retract of T#T. 
This fact implies that inclusion j : X —> T#T induces a monomorphism j,, so that 
1, (THT, xo) is not abelian. 

One can write equations for the retraction r : T#T — X, but it is simpler to 
indicate it in pictures, as we have done in Figure 60.2. Let Y be the union of two tori 
having a point in common. First one maps T#T onto Y by a map that collapses the 
dotted circle to a point but is otherwise one-to-one; it defines a homeomorphism h of 
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Figure 60.2 


the figure eight in T#T with the figure eight in Y. Then one retracts Y onto its figure 
eight by mapping each cross-sectional circle to the point where it intersects the figure 
eight. Then one maps the figure eight in Y back onto the figure eight in T #7 by the 
map A7!, a 


Corollary 60.7. The 2-sphere, torus, projective plane, and double torus are topolog- 


ically distinct. 
Exercises 
1. Compute the fundamental groups of the “solid torus” S! x B? and the product 
space S! x S?. 
2. Let X be the quotient space obtained from B? by identifying each point x of S! 
with its antipode —x. Show that X is homeomorphic to the projective plane P?. 
3. Let p : E — X be the map constructed in the proof of Lemma 60.5. Let E’ be 


the subspace of E that is the union of the x-axis and the y-axis. Show that p|£’ 
is not a covering map. 


. The space P! and the covering map p : S! — P! are familiar ones. What are 


they? 


. Consider the covering map indicated in Figure 60.3. Here, p wraps A; around A 


twice and wraps B, around B twice; p maps Ag and Bọ homeomoprphically 
onto A and B, respectively. Use this covering space to show that the fundamental 
group of the figure eight is not abelian. 


| 


(03) 


Figure 60.3 


Chapter 10 


Separation Theorems in the Plane 


There are several difficult questions concerning the topology of the plane that arise 
quite naturally in the study of analysis. The answers to these questions seem geomet- 
rically quite obvious but turn out to be surprisingly hard to prove. They include the 
Jordan curve theorem, the Brouwer theorem on invariance of domain, and the clas- 
sical theorem that the winding number of a simple closed curve is zero or +1. We 
prove them in this chapter as consequences of our study of covering spaces and the 
fundamental group. 


§61 The Jordan Separation Theorem 


We consider first one of the classical theorems of mathematics, the Jordan curve theo- 
rem. It states a fact that is geometrically quite believable, the fact that a simple closed 
curve in the plane always separates the plane into two pieces, its “inside” and its “out- 
side.” It was originally conjectured in 1892 by Camille Jordan, and several incorrect 
proofs were published, including one by Jordan himself. Eventually, a correct proof 
was provided by Oswald Veblen, in 1905. The early proofs were complicated, but over 
the years, simpler proofs have been found. If one uses the tools of modem algebraic 
topology, singular homology theory in particular, the proof is quite straightforward. 
The proof we give here is the simplest one we know that uses only results from the 
theory of covering spaces and the fundamental group. 
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Our proof of the Jordan curve theorem divides into three parts. The first, which 
we call the Jordan separation theorem, states that a simple closed curve in the plane 
separates it into at least two components. The second says that an arc in the plane 
does not separate the plane. And the third, the Jordan curve theorem proper, says that 
a simple closed curve C in the plane separates it into precisely two components, of 
which C is the common boundary. The first of these theorems will be treated in this 
section. 

In dealing with separation theorems, it will often be convenient to formulate them 
as separation theorems for subsets of S? rather than R?. The separation theorems 
for R? will follow. The connection between the two sets of theorems is provided by 
the following lemma. 

Recall that if b is any point of S?, there is a homeomorphism h of $? — b with 
R?; one simply takes a rotation of $? that carries b to the north pole, and follows it by 
stereographic projection. 


Lemma 61.1. Let C be a compact subspace of S*; let b be a point of S? — C; and let 
h be a homeomorphism of $? — b with R?. Suppose U is a component of $? — C. If U 
does not contain b, then h(U) is a bounded component of R? — h(C). If U contains b, 
then h(U — b) is the unbounded component of R? — h(C). 

In particular, if S? — C has n components, then R? — A(C) hasn components. 


Proof. We show first that if U is a component of $? — C, then U — b is connected. 
This result is trivial if b ¢ U, so suppose that b € U and suppose the sets A and B 
form a separation of U — b. Choose a neighborhood W of b disjoint from C such that 
W is homeomorphic to an open ball of R?. Since W is connected, it is contained in U 3 
since W — b is connected, it is contained entirely in A or in B. Say W — b C A. Then 
b is not a limit point of B, for W is a neighborhood of b disjoint from B. It follows 
that the sets A U {b} and B form a separation of U , contrary to hypothesis. 

Let {Ua} be the set of components of S? —C; let Vx = h(Ug — b). Because S? — C 
is locally connected, the sets Ug are connected, disjoint, open subsets of S 2. Therefore, 
the sets Vy are connected, disjoint, open subsets of R? — A(C), so the sets Vy are the 
components of R? — A(C). 

Now the homeomorphism h of S? — b with R? can be extended to a homeomor- 
phism H of $? with the one-point compactification R? U {c0} of R?, merely by setting 
H (b) = œ. If Up is the component of St-C containing b, then H (Ug) is a neighbor- 
hood of œ in R* U {oo}. Therefore Vg is unbounded; since its complement R? - Vg 
is compact, all the other components of R? — h(C) are bounded. See Figure 61.1. W 


Lemma 61.2 (Nulhomotopy lemma). Leta and b be points of S*. Let A be a 
compact space, and let 


f:A— $ -a-b 


be a continuous map. If a and b lie in the same component of S* — f(A), then f is 
nulhomotopic. 
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Figure 61.1 


Proof. One can replace $? by the one-point compactification R? U {oo} of R?, letting 
a and b correspond to the points 0 and oo. Then our lemma reduces to the following: 
Let A be a compact space and let g : A —> R? — 0 be a continuous map. If 0 lies in 
the unbounded component of R? — g(A), then g is nulhomotopic. 

This statement is easy to prove. Choose a ball B centered at the origin, of suffi- 
ciently large radius that it contains the set g(A). Choose a point p of R? lying out- 
side B. Then 0 and p both lie in the unbounded component of R? — g(A). 

Because R? is locally path connected, so is the open set R? — g(A). Therefore, the 
components and path components of R? — g(A) are the same. Hence we can choose a 
path æ in R? — g(A) from 0 to p. We define a homotopy G : A x I + R? — 0 by the 
equation 


G(x, t) = g(x) — a(t); 


it is pictured in Figure 61.2. The homotopy G is a homotopy between the map g and 
the map k defined by k(x) = g(x) — p. Note that G(x, t) # 0 because the path œ does 
Not intersect the set g(A). 

Now we define a homotopy H : A x 7 —> R? — 0 by the equation 


A (x,t) = tg(x) - p. 


It is a homotopy between the map k and a constant map. Note that H(x,t) Æ 0 
because ¢g(x) lies inside the ball B and p does not. 
Thus we have proved that g is nulhomotopic. u 


Now we prove the Jordan separation theorem. In general, if X is a connected space 
and A C X, we say that A separates X if X — A is not connected; if X — A has n 
components, we say that A separates X into n components. 
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Figure 61.2 


An arc A is a space homeomorphic to the unit interval (0, 1}. The end points of A 
are the two points p and q of A such that A — p and A — q are connected; the other 
points of A are called interior points of A. 

A simple closed curve is a space homeomorphic to the unit circle S$. 


Theorem 61.3 (The Jordan separation theorem). LetC be a simple closed curve 
in S*. Then C separates S?. 


Proof. Because $? — C is locally path connected, its components and path compo- 
nents are the same. We assume that $?—C is path connected and derive a contradiction. 

Let us write C as the union of two arcs A; and A2 that intersect only in their end 
points a and b. Let X denote the space S? — a — b. Let U be the open set S? — A; 
of X, and let V be the open set $? — A2. Then X is the union of the sets U and V, and 


UNV =$- (AUA) =$? -C, 


which by hypothesis is path connected. Thus the hypotheses of Theorem 59.1 are 
satisfied. 
Let xo be a point of U N V. We will show that the inclusions 


i : (U, xo) — (X, xọ) andj : (V, xo) —> (X, xo) 


induce trivial homomorphisms of the fundamental groups involved. It then follows 
from Theorem 59.1 that the group z; (X, xo) is trivial. But X = S? — a — b, whichis 
homeomorphic to the punctured plane R? — 0, so its fundamental group is nor trivial. 

Let us prove that i, is the trivial homomorphism; given a loop f : 1 > U based 
at xo, we show that i,({f]) is trivial. For this purpose, let p : J — S! be the standard 
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loop generating m(S!, bo). The map f - J > U induces a continuous map h : Si» 
U such that h o p = f. See Figure 61.3. 

Consider the mapioh : S! + $-a —b. By hypothesis, the set i(A(S')) = ACS!) 
does not intersect the connected set A; containing a and b. Therefore, a and b lie in 
the same component of $? — i(h(S!)). By the preceding lemma, the map i o h is 
nulhomotopic. It follows from Lemma 55.3 that (i o h), is the trivial homomorphism 
of fundamental groups. But 


(@oh)e(p)) = [i oh o p] = [i o f] =i.((f). 


Therefore, i«([f}) is trivial, as desired. E 
Í 6 Q 
i 
o> 
m — SS 
N pi 
U= §?-A, X=$?-a-b 
by 
s! 
Figure 61.3 


Let us examine the preceding proof. What facts did we use about the simple 
closed curve C? All we actually needed was the fact that C could be written as the 
union of the two closed connected sets A; and A2, whose intersection consisted of 
the two points a and b. This remark leads to the following generalized version of the 
separation theorem, which will be useful later. 


Theorem 61.4 (A general separation theorem). Let A, and A2 be closed con- 
nected subsets of $? whose intersection consists of precisely two points a and b. Then 
the set C = A; U A2 separates S°. 


Proof. We must show first that C cannot equal all of S?. That fact was obvious in 
the earlier proof. In the present case, we can see that C # S? because $? — a — b is 
connected and C — a — bis not. (The sets A; — a — b forma separation of C — a — b.) 

The remainder of the proof is a copy of the proof of the preceding theorem. a 


Exercises 


1. Give examples to show that a simple closed curve in the torus may or may not 
separate the torus. 
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2. Let A be the subset of R? consisting of the union of the topologist’s sine curve 
and the broken-line path from (0, —1) to (0, —2) to (1, —2) to (1, sin 1). See 
Figure 61.4. We call A the closed topologist’s sine curve. Show that if C is 
a subspace of S? homeomorphic to the closed topologist’s sine curve, then C 
separates S*. 


Figure 61.4 


*§62 Invariance of Domain’ 


One of the theorems of topology that is truly fundamental, because it expresses an 
intrinsic property of euclidean space, is the theorem on “invariance of domain,” proved 
by L. E. J. Brouwer in 1912. It states that for any open set U of R” and any continuous 
injective mapping f : U — R”, the image set f(U) is open in R” and the inverse 
function is continuous. (The Inverse Function Theorem of analysis derives this result 
under the additional hypothesis that the map f is continuously differentiable with non- 
singular Jacobian matrix.) We shall prove this theorem in the case n = 2. 


Lemma 62.1 (Homotopy extension lemma). Let X be a space such that X x I is 
normal. Let A be a closed subspace of X, and let f : A — Y be a continuous map, 
where Y is an open subspace of R”. If f is nulhomotopic, then f may be extended to 
a continuous map 8 : X — Y that is also nulhomotopic. 


Proof. Let F : Ax I — Y be a homotopy between f and a constant map. Then 
F(a,0) = f(a) and F(a, 1) = yo for all a. Extend F to the space X x 1 by setting 
F(x, 1) = yo for x € X. Then F is a continuous map of the closed subspace (A x 
1)U(X x 1) of X x I into R"; by the Tietze extension theorem, it may be extended to 
a continuous map G : X x | > R". 


tin this section, we use the Tietze extension theorem (§35) 
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Now the map x — G(x, 0) is an extension of f, but it maps X into R” rather 
than into the subspace Y. To obtain our desired map, we proceed as follows: Let U be 
the open subset U = G~!(¥) of X x I. Then U contains (A x 1) U(X x 1). See 
Figure 62.1. Since / is compact, the tube lemma implies that there is an open set W 
of X containing A such that W x J C U. Now the space X is itself normal, being 
homeomorphic to the closed subspace X x 0 of X x J. Therefore, we may choose a 
continuous function ġ : X — [0,1] such that @(x) = O for x € A and ¢(x) = 1 for 
x € X — W. The map x — x x #(x) carries X into the subspace (W x [) U(X x 1) 
of X x I, which lies in U. Then the continuous map g(x) = G(x, o(x)) carries X 
into Y. And for x € A, we have ¢(x) = 0, so that g(x) = G(x, 0) = f(x). Thus g is 
the desired extension of f. The map H : X x I — Y given by 


H(x,t) = G(x, (1 —t)ġ(x) +t) 


is a homotopy between g and a constant map. a 


Xx 


R’ 


Figure 62.1 


The following lemma is a partial converse to the nulhomotopy lemma of the pre- 
ceding section. 


Lemma 62.2 (Borsuk lemma). Leta and b be points of S?. Let A be a compact 
space, and let f : A > S? —a —b be a continuous injective map. If f is nulhomotopic, 
then a and b lie in the same component of S? — f(A). 


Proof. Because A is compact and S? is Hausdorff, f(A) is a compact subspace of $? 
that is homeomorphic to A. Because f is nulhomotopic, so is the inclusion mapping 
of f(A) into $? —a —b. Hence it suffices to prove the lemma in the special case where 
f is simply an inclusion map. Furthermore, we can replace $? by R? U {oo}, letting a 
correspond to 0, and b to co . Then our lemma reduces to the following statement: 

Let A be a compact subspace of R? — 0. If the inclusion j : A > R? — 0 is 
nulhomotopic, then 0 lies in the unbounded component of R? — A. 

This we now prove. Let C be the component of R? — A containing 0; we suppose 
C is bounded and derive a contradiction. Let D be the union of the other components 
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of R? — A, including the unbounded component. Then C and D are disjoint open sets 
of R?, and R? — A = C U D. See Figure 62.2. 

We define a continuous map h : R? —> R? — 0 that equals the identity outside C. 

Begin with the inclusion map j : A > R? — 0. Since j is by hypothesis nulho- 
motopic, the preceding lemma implies that j can be extended to a continuous map k 
of C UA into R? — 0. Then k equals the identity at points of A. Extend k to a map 
h : R? — R? — 0 by setting h(x) = x for x € DU A; then A is continuous by the 
pasting lemma. 

Now we derive a contradiction. Let B be the closed ball in R? of radius M centered 
at the origin, where M is so large that Int B contains C U A. (Here, we use the fact 
that C is bounded.) If we restrict h to B, we obtain a map g : B > R? — 0 such that 
g(x) = x for x € Bd B. If we follow g by the standard retraction x > Mx/||x|} of 
R? — 0 onto Bd B, we obtain a retraction of B onto Bd B. Such a retraction does not 
exist. a 


Figure 62.2 


Theorem 62.3 (Invariance of domain). If U is an open subset of R? and f : U —> 
R? is continuous and injective, then f(U) is open in R? and the inverse function 
Jal : f(U) > U is continuous. 


Proof. As usual, we can replace R? by S*. We show that if U is an open subset of R? 
and f : U — $? is continuous and injective, then f (U) is open in $? and the inverse 
function is continuous. 


Step 1. We show that if B is any closed ball in R? contained in U, then F(B) does 
not separate S?. 

Let a and b be two points of S? — f(B). Because the identity map i : B > B is 
nulhomotopic, the map h : B —> S? — a —b obtained by restricting f is nulhomotopic. 
The Borsuk lemma then implies that a and b lie in the same component of S? —h(B) = 
S? — f(B). 
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Step 2. We show that if B is any closed ball of R? lying in U, then f (Int B) is 
open in S?. 

The space C = f (Bd B) is a simple closed curve in S”, so it separates $?. Let V 
be the component of S? — C that contains the connected set f (Int B), and let W be 
the union of the others. Because $? is locally connected, V and W are open in S?. We 
show V = f(Int B), and we are through. 

We suppose a is a point of V that is not in f (Int B) and derive a contradiction. Let 
b be a point of W. Since the set D = f(B) does not separate S?, the set $? — D is 
a connected set containing a and b. This set is contained in S -cC (since D D C); 
it follows that a and b lie in the same component of S? — C, contrary to construction. 
See Figure 62.3. 


C= f(Bd B) 


Figure 62.3 


Step 3. We prove the theorem. Since, for any ball B contained in U, the set 
f (int B) is open in $?, the map f : U —> S? is an open map. It follows that f (U) is 
open in $? and f~! is continuous. a 


Exercises 


1. Give an example to show that the conclusion of the Borsuk lemma need not hold 
if f is not injective. 

2. Let A be a compact contractible subspace of S*. Show that A does not sepa- 
rate S?. 

3. Let X be a space such that X x / is normal. Let A be a closed subspace of X; 
let f : A — Y be a continuous map, where Y is an open subspace of R”. If f is 
homotopic to a map that is extendable to a continuous map h : X — Y, then f 
itself is extendable to a continuous map g : X — Y, such that g œ h. 

4. Let C be a simple closed curve in R? — 0; let j : C —> R? — 0 be the inclusion 
mapping. Show that j, is trivial if 0 lies in the unbounded component of R? — C, 
and is nontrivial otherwise. (In fact, j, is an isomorphism in the latter case, as 
we shall prove in §65.) 
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5. Theorem. Let U be a simply connected open set in R?. If C is a simple closed 
curve lying in U , then each bounded component of R? — C also lies in U. 
(This condition actually characterizes the simply connected open sets of R?. 
See [RW]. The space R? — C has, of course, only one bounded component, as 
we shall prove in the next section.) 
6. Suppose you are given that there is no retraction of B” onto S"~!. 
(a) Show the Borsuk lemma holds for S”. 
(b) Show that no compact contractible subspace of S” separates $". 
(c) Suppose you are given also that any subspace of S” homeomorphic to S*~! 
separates 5”. Prove the invariance of domain theorem in dimension n. 


§63 The Jordan Curve Theorem 


The special case of the Seifert-van Kampen theorem that we used in proving the Jordan 
separation theorem tells us something about the fundamental group of the space X = 
U UV in the case where the intersection U N V is path connected. In the next theorem, 
we examine what happens when U N V is not path connected. This result will enable 
us to complete the proof of the Jordan curve theorem. 


Theorem 63.1. Let X be the union of two open sets U and V, such that UM V can be 
written as the union of two disjoint open sets A and B. Assume that there is a path a 
in U from a point a of A toa point b of B, and that there is a path B in V from b toa. 
Let f be the loop f =a x $. 
(a) The path-homotopy class [ f | generates an infinite cyclic subgroup of n, (X, a). 
*(b) If z;(X, a) is itself infinite cyclic, it is generated by [ f }.f 
(c) Assume there is a path y in U from a to the point a’ of A, and that there is a 
path ô in V froma’ toa. Let g be the loop g = y » ô. Then the subgroups of 
mt, (X, a) generated by [f] and [g] intersect in the identity element alone. 


Proof. The proof is in many ways an imitation of the proof in §54 that the fundamen- 
tal group of the circle is infinite cyclic. As in that proof, the crucial step is to find an 
appropriate covering space E for the space X. 

Step 1. (Construction of E). We construct E by pasting together copies of the 
subspaces U and V. Let us take countably many copies of U and countably many 
copies of V, all disjoint, say 


Ux(2n) and Vx (2n+1) 


for all n € Z, where Z denotes the integers. Let Y denote the union of these spaces; 
Y is a subspace of X x Z. Now we form a new space E as a quotient space of Y by 


tThis result uses Theorem 54.6, and will be used only when we deal with winding numbers 
in §65 
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identifying the points 

xx(2n) and xx(2n-—1) foxeA 
and by identifying the points 

xx(2n) and xx(2n+1) forxe B. 


Let x : Y — E be the quotient map. 

Now the map p : Y —> X defined by p(x x m) = x induces a map p : E > X; 
the map p is continuous because E has the quotient topology. The map p is also 
surjective. We shall show that p is a covering map. See Figure 63.1. 

First let us show that the map z is an open map. Since Y is the union of the disjoint 
open sets {U x (2n)} and {V x (2n + 1)}, it will suffice to show that 7|(U x 2n) and 
m\(V x (2n + 1)) are open maps. And this is easy. Take an open set in U x 2n, for 
example; it will be of the form W x 2n, where W is open in U. Then 


nm! ae(W x 2n)) =(W x 2nJUL(W OB) x (2n + 1)) 
U[(W N A) x (2n — 1)], 


which is the union of three open sets of Y and hence open in Y. By definition of the 
quotient topology, 7(W x 2n) is open in E, as desired. 

Now we prove that p is a covenng map; we show that the open sets U and V 
are evenly covered by p. Consider U, for example. The set p~!(U) is the union of 
the disjoint sets 7(U x 2n) for n € Z. Each of these sets is open in E because z is 
an open map. Let 72, denote the restriction of x to the open set U x 2n, mapping 
it onto z(U x 2n). It is a homeomorphism because it is bijective, continuous, and 
open. Then when restricted to 7(U x 2n), the map p is just the composite of the two 
homeomorphisms 


at 
R. 
n(U x 2n) 2> U x 2n >U 


and is thus a homeomorphism. Therefore, pix (U x 2n) maps this set homeomorphi- 
cally onto U, as desired. 

Step 2. Now we define a family of liftings of the loop f = a x $. 

For each integer n, let en be the point z(a x 2n) of E. Then the points e, are 
distinct, and they constitute the set p~'(a). We define a lifting fa of f that begins 
at €n and ends at én41- 

Since «æ and £ are paths in U and V, respectively, we can define 


an(s) = n (a(s) x 2n), 
Bris) = n (B(s) x (2n + D); 


then @, and ĝņ are liftings of a and £, respectively. (The case n = 0 is illustrated in 
Figure 63.1.) The product a, *8, is defined, since @,, ends at 7 (bx 2n) and B, begins at 
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Ux2 


Uxo 


U x (-2) 


Figure 63.1 


m(bx(2n+1)). We set fy = G@n*Bn, and note that ta begins at a,(0) = z (a x2n) = €n 
and ends at £, (1) = z(a x (2n + 1)) = z(a x (2n + 2)) = en. 

Step 3. We show that [f] generates an infinite cyclic subgroup of 71(X,a). It 
suffices to show that if m is a positive integer, then [ f]” is not the identity element. 
But this is easy. For the product 


h= fox (fix (x fm-1)) 
is defined and is a lifting of the m-fold product 
h= fx (fx(--% f)). 
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Because h begins at eg and ends at em, the class [kh] = [ f ]” cannot be trivial. 


*Step 4. Now we show that if 77 (X, a) is infinite cyclic, it is generated by [f]. 
Consider the lifting correspondence ¢ : xı(X,a) > p`! (a). We showed in Step 3 
that for each positive integer m, the correspondence ¢ carries {f ]” to the point em of 
p~! (a). A similar argument shows that it carries [ f]~” to e-m. Thus ¢ is surjective. 
Now by Theorem 54.6, ¢ induces an injective map 


È: m(X,a)/H — p`! (a), 


where H = p,(71(E, e0)); the map ® is surjective because ¢ is surjective. It follows 
that H is the trivial group, since the quotient of an infinite cyclic group by any non- 
trivial subgroup is finite. Then the lifting correspondence ¢ itself is bijective; since 
it maps the subgroup generated by [f] onto p~! (a), this subgroup must equal all of 
7 (X, a). 

Step 5. Now we prove (c). The picture in Figure 63.1 may mislead you into 
thinking that the element [g] of 7;(X, a) considered in part (c) is in fact trivial. But 
that figure is rather special. Figure 63.2 illustrates what can occur when A is itself 
the union of two disjoint nonempty open sets. In this case (which will be useful to us 
shortly) both [ f] and [g] generate infinite cyclic subgroups of 7, (X, a). 


% 


Figure 63.2 


Given g = y « ô, we define a lifting of g to E as follows: Since y is a path in U, 
we can define 


y(s) = z(y (s) x 0); 
since ô is a path in V, we can define 


5(s) = (8(s) x (-1)). 
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Then 7 and 6 are liftings of y and 6. The product g = 7 « 4 is defined, since y ends 
at x(a’ x 0) and Š begins at z(a’ x (—1)); and it isa lifting of g. Note that Z isa loop 
in E, for it begins and ends at z(a x 0) = x (a x (—1)) = eo. 

It follows that the subgroups generated by [f] and [g] have only the identity el- 
ement in common. For the m-fold product of f with itself lifts to a path that begins 
at €o and ends at em, while every product of g with itself lifts to a path beginning and 
ending at e9. Hence [ f]” # [g}* for every nonzero m and k. C] 


Theorem 63.2 (A nonseparation theorem). Let D be an arc in S*. Then D does 
not separate S?. 


Proof. We give two proofs of this theorem. The first uses the results of the preceding 
section, and the second does not. 

First proof. Because D is contractible, the identity map i : D > D is nulhomo- 
topic. Hence if a and b are any two points of S? not in D, the inclusion j : D —> 
S? — a — b is nulhomotopic. The Borsuk lemma then implies that a and & lie in the 
same component of $? — D. 

Second Proof. Let us write D as the union of two arcs D, and D3 that intersect in 
a single point d. Let a and b be points not in D. We show that if a and b can be joined 
by paths in S? — D; and in $? — Do, then they can be joined by a path in $? — D. 
Figure 63.3 illustrates the fact that this assertion is not entirely trivial. 


Figure 63.3 


We suppose that a and b cannot be joined by a path in $? — D and derive a con- 
tradiction. We apply Theorem 63.1. Let X be the space S? — d. Let U and V be the 
open sets 


U=S°-D, and V= -D 


Then X = UUV, and UNV = $? — D. By hypothesis, a and b are points of S$? — D 
that cannot be joined by a path in $? — D. Therefore, U N V is not path connected. 
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Let A be the path component of U N V containing a; let B be the union of the other 
path components of U N V. Since U N V is locally path connected (being open in 5?), 
the path components of U N V are open; hence A and B are open in X. We are given 
that a and b can be joined by paths in U = $? — D; and V = $? — Dz. We conclude 
from Theorem 63.1 that xı (X, a) is not tnvial. But X = S? — d, so its fundamental 
group is trivial. 

Now we prove the theorem. Given the arc D and the points a and b of S? — D, 
we suppose that a and b cannot be joined by a path in S? — D and derive a con- 
tradiction. Choose a homeomorphism 4 : [0,1] > D; let Di = h([0, 1/2)) and 
D2 = h([1/2, 1)). The result of the preceding paragraph shows that since a and b can- 
not be joined by a path in $? — D, they cannot be joined by paths in both S? — D; and 
S? — Dy. To be definite, suppose that a and b cannot be joined by a path in S? — Dj. 

Now repeat the argument, breaking D; up into two arcs E} = h([0, 1/4)) and 
Ez = h({1/4, 1/2}). We conclude, as before, that a and b cannot be joined by paths in 
both $? — E; and $S? — E2. 

Continue similarly. In this way we define a sequence 


IdDhDhkhD- 


of closed intervals such that 7, has length (1/2)” and such that for each n, the points a 
and b cannot be joined by a path in S? — h(/,). Compactness of the unit interval 
guarantees there is a point x in () /,; since the lengths of the intervals converge to 
zero, there is only one such point. 

Consider the space S?—h(x). Since this space is homeomorphic to R?, the points a 
and b can be joined by a path a in S? — h(x). Because a(/) is compact, it is closed, 
so some €-neighborhood of h(x) is disjoint from a(/). Then because h is continuous, 
there is some m such that A(/,,) lies in this €-neighborhood. It follows that a is a path 
in S? ~ h(/,,) joining a and b, contrary to hypothesis. a 


Both proofs of this theorem are interesting. As we noted in §62, the first gener- 
alizes to show that no compact contractible subspace of S? separates S?. The second 
generalizes in another direction. Let us examine this second proof, and ask ourselves 
what properties of the sets D, and D2 made it work? One readily sees that all that was 
needed was the fact that Dı and D2 were closed subsets of S? and that $? ~ (D, N D2) 
was simply connected. Hence we have the following result, which we shall use later: 


Theorem 63.3 (A general nonseparation theorem). Let Dı and Dz be closed sub- 
sets of $? such that S? — D, N Dy is simply connected. If neither D, nor Dz separates 
S?, then D, U Dz does not separate S?. 


Now we prove the Jordan curve theorem. 
Theorem 63.4 (The Jordan curve theorem). Let C be a simple closed curve in S 2 


Then C separates S? into precisely two components W and W2. Each of the sets W, 
and W2 has C as its boundary; that is, C = W; — W; fori = 1, 2. 
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Proof. Step 1. We first prove that S? — C has precisely two components. Write C as 
the union of two arcs C, and C3 that intersect in a two-point set {p, q}. Let X be the 
space $? — p — q, and let U and V be the open sets 


U=S-C, and V=S*-Cp. 


Then X = UUV,and UNV = S? —C. The space UNV has at least two components, 
by the Jordan separation theorem. 

We suppose that U N V has more than two components and derive a contradiction. 
Let A, and A2 be two of the components of U N V, and let B be the union of the 
others. Because $? — C is locally connected, each of these sets is open. Leta € A, 
and a’ € Az and b e B. Because the arcs C; and C2 do not separate S?, there are 
paths a and y in U from a to b and from a to a’, respectively, and there are paths B 
and ô in V from b to a and from a’ to a, respectively. Consider the loops f = a * $ 
and g = y xô. Writing U N V as the union of the open sets A U A2 and B, we see that 
Theorem 63.1 implies that [ f} is a nontrivial element of xı (X, a). Writing U N V as 
the union of the disjoint open sets A, and A? U B, we see that [g} is also a nontrivial 
element of 7(X, a). Since 2;(X, a) is infinite cyclic, we must have [ f ]" = [e}* for 
some nonzero integers m and k. This result contradicts (c) of Theorem 63.1. 


Step 2. Now we show that C is the common boundary of W; and W 

Because S? is locally connected, each of the components W; and Wz of S? — C 
is open in S°. In particular, neither contains a limit point of the other, so that both the 
sets W, — W, and Wz — W2 must be contained in C. 

To prove the reverse inclusion, we show that if x is a point of C, every neighbor- 
hood U of x intersects the closed set W, — Wy. It follows that x is in the set W, — Wy. 

So let U be a neighborhood of x. Because C is homeomorphic to the circle S!, we 
can break C up into two arcs Cı and C2 that intersect in only their end points, such 
that Cı is small enough that it lies inside U. See Figure 63.4. 


Figure 63.4 
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Let a and b be points of W; and W2, respectively. Because C2 does not separate $ 2 
we can find a path a in S? — C2 joining a and b. The set a (7) must contain a point y of 
the set WwW, — W,, because otherwise a(/) would be a connected set lying in the union 
of the disjoint open sets W; and $S? — Wj, and intersecting each of them. The point y 
belongs to the closed curve C, since (W; — Wi) C C. Because the path a does not 
intersect the arc C2, the point y must therefore lie in the arc C,, which in tum lies in 
the open set U. Thus, U intersects W, — W, in the point y, as desired. a 


Just as with the earlier theorems, we now ask ourselves what made the proof of 
this theorem work. Examining Step 1 of the proof, we see that all we used were the 
facts that Cı and C2 were closed connected sets, that C; N C2 consisted of two points, 
and that neither Cı nor Cz separated S?. The first two facts implied that C, U C2 
separated S? into at least two components; the third implied that there were only two 
components. Hence one has, with no further effort, the following result: 


Theorem 63.5. Let C, and C2 be closed connected subsets of S? whose intersection 
consists of two points. If neither C nor C2 separates S*, then C, U C2 separates S? 
into precisely two components. 


EXAMPLE 1. The second half of the Jordan curve theorem, to the effect that C is the 
common boundary of W) and W2, may seem so Obvious as hardly to require comment. But 
it depends crucially on the fact that C is homeomorphic to S!. 

For instance, consider the space indicated in Figure 63.5. It is the union of two arcs 
whose intersection consists of two points, so it separates S? into two components W; 
and W2 just as the circle does, by Theorem 63.5. But C does not equa! the common 
boundary of W, and W3 in this case. 


Figure 63.5 


There is a fourth theorem that is often considered along with these three separation 
theorems. It is called the Schoenflies theorem, and it states that if C is a simple closed 
curve in S? and U and V are the components of $? — C, then U and V are each 
homeomorphic to the closed unit ball B*. A proof may be found in [H-S]. 

The separation theorems can be generalized to higher dimensions as follows: 

(1) Any subspace C of S” homeomorphic to S*~! separates S”. 
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(2) No subspace A of S” homeomorphic to [0, 1] or to some ball B™ separates S". 
(3) Any subspace C of S” homeomorphic to S$"! separates S” into two components, 
of which C is the common boundary. 

These theorems can be proved quite readily once one has studied singular ho- 
mology groups in algebraic topology. (See [Mu], p. 202.) The Brouwer theorem on 
invariance of domain for R” follows as a corollary. 

The Schoenflies theorem, however, does not generalize to higher dimensions with- 
out some restrictions on the way the space C is imbedded in S”. This is shown by the 
famous example of the “Alexander horned sphere,” a homeomorphic image of $? in $3, 
one of whose complementary domains is not simply connected! (See [H-Y], p. 176.) 

The separation theorems can be generalized even further than this. The defini- 
tive theorem along these lines is the famous Alexander-Pontryagin duality theorem, a 
rather deep theorem of algebraic topology, which we shall not attempt to state here. 
(See [Mu].) It implies that if the closed subspace C separates S” into k components, 
so does any subspace of 5” that is homeomorphic to C (or even homotopy equivalent 
to C). The separation theorems (1)—-(3) are immediate corollaries. 


Exercises 


1. Let C, and C2 be disjoint simple closed curves in S?. 
(a) Show that S? — Cı — Cp has precisely three components. (Hint: If W, is 
the component of $? — C, disjoint from C2, and if Wz is the component of 
S? — Cy disjoint from C,, show that W; U W3 does not separate S2.] 
(b) Show that these three components have boundaries C; and C2 and C; U C3, 
respectively. 
2. Let D be a closed connected subspace of S? that separates S? into n components. 
(a) If A is an arc in S? whose intersection with D consists of one of its end 
points, show that D U A separates $? into n components. 
(b) If A is an arc in S? whose intersection with D consists of its end points, 
show that D U A separates S? into n + 1 components. 
(c) If C is a simple closed curve in $? that intersects D in a single point, show 
DUC separates S? into n + 1 components. 


*3, (a) Let D be a subspace of S? homeomorphic to the topologist’s sine curve Š. 
(See §24.) Show that D does not separate S?. [Hint: Leth : Š + D be the 
homeomorphism. Given 0 < ¢ < 1, let 5, equal the intersection of S with 
the set {(x, y) | x < c}. Show that given a, b € S? — D, there is, for some 
value of c, a path in S? — h(S,) from a to b. Conclude that there is a path in 
S? — D from a to b] 

(b) Let C be a subspace of S$? homeomorphic to the closed topologist’s sine 
curve. Show that C separates S? into precisely two components, of which C 
is the common boundary. [Hint: Let h be the homeomorphism of the closed 
topologist’s sine curve with C. Let Co = h(0 x [—1, 1]). Show first, using 
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the argument of Theorem 63.4, that each point of C — Co lies in the boundary 
of each component of S? — C.] 


$64 Imbedding Graphs in the Plane 


A (finite) linear graph G is a Hausdorff space that is written as the union of finitely 
many arcs, each pair of which intersect in at most a common end point. The arcs are 
called the edges of the graph, and the end points of the arcs are called the vertices of 
the graph. 

Linear graphs are used in mathematics to model many real-life phenomena; how- 
ever, we shall look at them simply as interesting spaces that in some sense are gener- 
alizations of simple closed curves. 

Note that any graph is determined completely (up to homeomorphism) by listing 
its vertices and specifying which pairs of vertices have an edge joining them. 


EXAMPLE |. If G contains exactly n vertices, and if for every pair of distinct vertices 
of G there is an edge of G joining them, then G is called the complete graph on n vertices 
and is denoted G,. Several such graphs are pictured in Figure 64.1. Note that the first 
three of these graphs are pictured as subspaces of R?, but the fourth is pictured instead as 
a subspace of R?. A little experimentation will convince you that this graph cannot in fact 
be imbedded in R?. We shall prove this result shortly. 


TAARA 


Figure 64.1 


EXAMPLE 2. Another interesting graph arises in considering the classical puzzle: “Given 
three houses, 41, 42, and h3, and three utilities, g (for gas), w (for water), and e (for elec- 
tricity), can you connect each utility to each house without letting any of the connecting 
lines cross?" Formulated mathematically, this is just the question whether the graph pic- 
tured in Figure 64.2, which is called the utilities graph, can be imbedded in R?. Again, a 
little experimentation will convince you that it cannot, a fact that we shall prove shortly 


Definition. A theta space X is a Hausdorff space that is written as the union of three 
arcs A, B, and C, each pair of which intersect precisely in their end points. (The 
space X is of course homeomorphic to the Greek letter theta.) 


864 Imbedding Graphs ın the Plane 395 


Figure 64.2 


Note that as it stands, a theta space X is not a linear graph, for the arcs in question 
intersect in more than a common end point. One can write it as a graph, however, by 
breaking each of the arcs A, B, and C up into two arcs with an end point in common. 


Lemma 64.1. Let X be a theta space that is a subspace of S?; let A, B, and C be the 
ares whose union is X. Then X separates S? into three components, whose boundaries 
are AUB, BUC, and AUC, respectively. The component having AUB as its boundary 
equals one of the components of S? — AU B. 


Proof. Let a and b be the end points of the arcs A, B, and C. Consider the simple 
closed curve A U B; it separates S? into two components U and U’, each of which is 
open in S? and has boundary A U B. See Figure 64.3. 


yt 


Figure 64.3 


The space C ~ a — b is connected, so it is contained in one of these components, 
say in U’. Then consider the two spaces U = U U A U B and C; each is connected. 
Neither separates S?, for C is an arc, and the complement of U is the connected set U’. 
Since the intersection of these two sets consists of the two points a and b, their union 
separates S? into two components V and W, by Theorem 63.5. It follows that S? — 
(A U B UC) is the union of the three disjoint connected sets U, V, and W; because 
they are open in S?, they are the components of St — (A U B U C). The component 
U has A U B as its boundary. Symmetry implies that the other two have B U C and 


396 Separation Theorems in the Plane Ch. 10 


A UC as their boundaries. E 


Theorem 64.2. Let X be the utilities graph. Then X cannot be imbedded in the plane. 


Proof. 1f X can be imbedded in the plane, then it can be imbedded in S?. So suppose 
X is a subspace of S?. We derive a contradiction. 

We use the notation of Example 2, where g, w, e, hı, h2, and h3 are the vertices 
of X. Let A, B, and C be the following arcs contained in X: 


A=ghw, 
B= ghow, 
C = gh3w. 


Each pair of these arcs intersect in their end points g and w alone; hence Y = AUBUC 
is a theta space. The space Y separates S? into three components U, V, and W, whose 
boundaries are A U B, B U C, and A U C, respectively. See Figure 64.4. 

Now the vertex e of X lies in one of these three components, so that the arcs eh, 
and ehz and eh3 of X lie in the closure of that component. That component cannot 
be U, for U is contained in U U A U B, a set that does not contain the point A3. 
Similarly, the component containing e cannot be V or W, because V does not contain 
hy, and W does not contain h2. Thus, we have reached a contradiction. a 


Figure 64.4 


Lemma 64.3. Let X be a subspace of S? that is a complete graph on four vertices ay, 
a2, a3, and a4. Then X separates S? into four components. The boundaries of these 
components are the sets X4, X2, X3, and X4, where X; is the union of those edges 
of X that do not have a; as a vertex. 


Proof. Let Y be the union of all the arcs of X different from the arc a2a4. Then we 
can write Y as a theta space by setting 

A = 44243, 

B = a403, 

C = a48403. 
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See Figure 64.5. The arcs A, B, and C intersect in their end points a, and a3 alone, 
and their union is Y. 


Figure 64.5 


The space Y separates $? into three components U, V, and W, whose boundaries 
are AU B, BUC, and AUC, respectively. The space a2a4 — a2 — ag, being connected, 
must lie in one of them. It cannot lie in U, because A U B does not contain a4. And it 
cannot lie in V because B U C does not contain a2. Hence it must lie in W. 

Now U U V is connected because U and V are connected and have nonempty in- 
tersection B. Furthermore, the set UU V does not separate S?, because its complement 
is W. Similarly, the arc ajaq is connected and does not separate S?. And the sets aza4 
and U U V intersect in the points a2 and a4 alone. It follows from Theorem 63.5 that 
a2a4 UU U V separates S? into two components W; and W2. Then S? — Y is the union 
of the four disjoint connected sets U, V, W1, and W2. Since these sets are open, they 
are the components of $? — Y. 

Now one of these components, namely U, has the graph A U B = X4 as its bound- 
ary. Symmetry implies that the other three have X,, X2, and X3 as their respective 
boundaries. a 


Theorem 64.4. The complete graph on five vertices cannot be imbedded in the plane. 


Proof. Suppose that G is a subspace of S? that is a complete graph on the five vertices 
Qj, @2, a3, ag, and as. Let X be the union of those edges of G that do not have as as 
a vertex; then X is a complete graph on four vertices. The space X separates S? into 
four components, whose respective boundaries are the graphs X;,..., X4, where X; 
consists of those edges of X that do not have a; as a vertex. Now the point as must lie 
in one of these four components. It follows that the connected space 


ajas U a2a5 U a3a5 Uagas, 


which is the union of those edges of G that have as as a vertex, must lie in the closure of 
this component. Then all the vertices a), . . . , a4 lie in the boundary of this component. 


398 Separation Theorems in the Plane Ch. 10 


But this is impossible, for none of the graphs X; contains all four vertices aj, ..., a4. 
Thus we reach a contradiction. a 


It follows from these theorems that if a graph G contains a subgraph that is a 
utilities graph or a complete graph on five vertices, then G cannot be imbedded in the 
plane. It is a remarkable theorem, due to Kuratowski, that the converse is also true! 
The proof is not easy. 


Exercise 


1. Let X be a space that is written as the union of finitely many arcs A], ..., An, 
each pair of which intersect in at most a common end point. 
(a) Show that X is Hausdorff if and only if each arc A; is closed in X. 
(b) Give an example to show that X need not be Hausdorff. [Hint: See Exer- 
cise 5 of §36.] 


§65 The Winding Number of a Simple Closed Curve 


If h : S! — R? — 0 is a continuous map, then the induced homomorphism h, carries a 
generator of the fundamental group of S! to some integral power of a generator of the 
fundamental group of R? — 0. This integral power n is called the winding number of h 
with respect to 0. It measures how many times h “wraps S! around the origin;” its sign 
depends of course on the choice of generators. See Figure 65.1. We will introduce it 
more formally in the next section. 


n=t2 n=0 


Figure 65.1 


For the present, we merely ask the question: What can one say about the winding 
number of h if h is injective, that is, if h is a homeomorphism of S! with a simple 
closed curve C in R? — 0? The illustrations in Figure 65.2 suggest the obvious con- 
jecture: If 0 belongs to the unbounded component of R? — C, then n = 0, while if 0 
belongs to the bounded component, then n = +1. 
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Figure 65.2 


The first conjecture is easy to prove, for Lemma 61.2 tells us that h is nulhomotopic 
if 0 belongs to the unbounded component of R? — C. On the other hand, the second 
conjecture is surprisingly difficult; it is in fact a rather deep result. We prove it in this 
section. 

As usual, we shall replace R? U {c0} by S?, letting p be the point corresponding 
to 0 and q be the point corresponding to oo. Then our conjecture can be reformulated 
as follows: If C is a simple closed curve in S?, and if p and q belong to different 
components of S? — C, then the inclusion mapping j : C > S? — p — q induces an 
isomorphism of fundamental groups. This is what we shall prove. 

First, we prove our result in the case where the simple closed curve C is contained 
in a complete graph on four vertices. Then we prove the general case. 


Lemma 65.1. Let G be a subspace of S? that is a complete graph on four vertices 
a;,...,a4. Let C be the subgraph a,a2a3a4a, which is a simple closed curve. Let p 
and q be interior points of the edges a,a3 and aza4, respectively. Then: 
(a) The points p and q lie in different components of S? — C. 
(b) The inclusion j : C —> S? — p — q induces an isomorphism of fundamental 
groups. 


Proof, (a) As in the proof of Lemma 64.3, the theta space C U ajaz separates S? into 
three components U, V, and W. One of these, say W, has C as its boundary; it is the 
only component whose boundary contains both az and a4. Therefore, a2a4 — a2 — a4 
must lie in W, so that in particular, g belongs to W. Of course, p is not in W because p 
belongs to the theta space C U a,a3. Now Lemma 64.1 tells us that W is one of the 
components of S? — C; therefore, p and q belong to different components of S? — C. 

(b) Let X = S? — p — q. The idea of the proof is the following: We choose a 
point x interior to the arc ajaz, and a point y interior to the arc a3aq. And we let a and 
B be the broken-line paths 


a = xaja4y and £B = ya3apx. 


Then a + £ is a loop lying in the simple closed curve C. We shall prove that œ x $ rep- 
resents a generator of the fundamental group of X. It follows that the homomorphism 
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jx: ™(C,x) > 7(X, x) is surjective, so that ją must be an isomorphism (since the 
groups involved are infinite cyclic). See Figure 65.3. 


Let D; and D> be the arcs 
D, = pazanq and D2 = qagayp, 


and let U = S? — Dı and V = $? — D3. See Figure 65.4. Then X = U U V, and 
U N V equals S? — D, where D is the simple closed curve D = D; U D2. Hence, 
UNV has two components, by the Jordan curve theorem. Furthermore, since D equals 
the simple closed curve a,a3a2a4a), the result of (a) implies that the points x and y, 
which lie interior to the other two edges of the graph G, lie in different components of 
S*— D. 


Figure 65.4 


The hypotheses of Theorem 63.1 are thus satisfied. The path æ is a path in U 
from x to y, while £ is a path in V from y to x. Because the fundamental group of X 
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is infinite cyclic, the loop a x $ represents a generator of this group. a 


Now we prove our main theorem. 


Theorem 65.2. Let C be a simple closed curve in S?; let p and q lie in different 
components of S? — C. Then the inclusion mapping j : C > S? — p — q induces an 
isomorphism of fundamental groups. 


Proof. The proof involves constructing a complete graph on four vertices that con- 
tains C as a subgraph. 

Step 1. Leta, b, and c be three distinct points of R?. If A is an arc with end points 
points a and b, and if B is an arc with end points b and c, then there exists an arc 
contained in A U B with end points a and c. 

Choose paths f : I — A froma to b, and g : I — B from b toc, such that f 
and g are homeomorphisms. Let tọ be a smallest point of 7 such that f (to) € B; and 
let zı be the point of J such that g(t;) = f (to). Then the set f ([0, to)) U g((t, 1) is 
the required arc. (If tọ = 0 ort; = 1, one of these sets consists of a single point.) See 
Figure 65.5. 


I(t) 


Figure 65.5 


Step 2. We show that if U is an open set of R?, any two points of U that can be 
connected by a path in U are the end points of an arc lying in U. 

If x, y € U, set x ~ y if x = y or if there is an arc in U with end points x and y. 
The result of Step 1 shows that this is an equivalence relation. The equivalence classes 
are open, for if the €-neighborhood of x lies in U, it consists of points equivalent to x. 
Since U is connected, there is only one such equivalence class. 


Step 3. Let C be a simple closed curve in R?. We construct a subspace G of R? 
that is a complete graph on four vertices a), ..., a4 such that C equals the subgraph 
@\a7a3a4a|. 

For convenience, we assume that 0 lies in the bounded component of R? — C. 
Consider the x-axis R x 0 in R?; let a; be the largest point on the negative x-axis that 
lies in C, and let a3 be the smallest point on the positive x-axis that lies in C. Then the 
line segment aj a3 lies in the closure of the bounded component of R?-C. 

Let us write C as the union of two arcs Cı and C2 with end points a, and a3. 
Let a be a point of the unbounded component of R? — C. Since C, and C2 do not 
separate R?, we can choose paths a : | > R? — Ci and b: 1 —> R -C2 froma 
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to 0; in view of Step 2, we may assume that œ and £ are injective. Let az = a(t), 
where to is the smallest number such that a(t9) € C; then a2 is a point interior to C2. 
Similarly, let a4 = (tı), where t; is the smallest number such that B(1;) € C; then a4 
is an interior point of C4. Then a({0, to]) and 8([0, t)]) are arcs joining a to az and a4, 
respectively; by Step 2, their union contains an arc with end points a2 and a4; this arc 
intersects C only in these two points. This arc, along with the line segment aja3 and 
the curve C, forms the desired graph. See Figure 65.6. 


Step 4. It follows from the result of Step 3 and the preceding lemma that for some 
pair of points p, q lying in different components of S? — C, the inclusion j : C > 
S? — p — q induces an isomorphism of fundamental groups. To complete the proof, 
we need only show that the same holds for any pair p, q of points lying in different 
components of S? — C. For that purpose, it suffices to prove the following: 

_ Let D bea simple closed curve in R?; suppose 0 lies in the bounded component of 
R?—D. Let p be another point of this component. If inclusion j : D > R?—0 induces 
an isomorphism of fundamental groups, then so does the inclusion k : D —> R? — p. 

Let f : R? — p > R? — 0 be the homeomorphism f(x) = x — p. It suffices to 

show that the map 


p—+R?-p—>R?-0 


indices an isomorphism of fundamental groups. Let a be a path in R? — D from 0 
to p, and let F : Dx 1 > R? — 0 be the map F(x,t) = x — a(t). Then F isa 
homotopy between j and f o k; since j induces an isomorphism, so does f ok. (See 
Corollary 58.5). a 
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This theorem is a special case of a rather deep theorem of algebraic topology, 
concerning the “linking number” of two disjoint subspaces of S”+"+! one homeo- 
morphic to an m-sphere and the other homeomorphic to an n-sphere; it is related to 
the Alexander duality theorem. (See [Mu], p. 433.) The special case of our theorem is 
that ot a O-sphere (i.e., a two-point space) and a 1-sphere (i.e., a simple closed curve) 
in S4. 


§66 The Cauchy Integral Formula 


One of the central theorems in the study of functions of a complex variable is the one 
concerning the Cauchy integral formula for analytic functions. For the classical ver- 
sion of this theorem, one needs to assume not only the Jordan curve theorem, but also 
the winding-number theorem of the last section. There is, however, a reformulation of 
the Cauchy integral theorem that avoids using these results; this version of the theo- 
rem, although it is rather less natural, is the one now commonly found in texts on the 
subject. 

Since we have the Jordan curve theorem at our disposal, we shall set ourselves the 
task of deriving the Cauchy integral formula in its classical version from the reformu- 
lated version. 

We begin by introducing the notion of “winding number” more formally. 


Definition. Let f be a loop in R?, and let a be a point not in the image of f. Set 


g(s) = [f (s) — a)/Il f (s) — all; 


then g is a loop in S!. Let p : R —> S! be the standard covering map, and let g be a 

lifting of g to S!. Because g is a loop, the difference 2(1) — 2(0) is an integer. This 

integer is called the winding number of f with respect to a, and is denoted n(f, a). 
Note that n( f, a) is independent of the choice of the lifting of g. For if g is one 


lifting of g, then uniqueness of liftings implies that any other lifting of g has the form 
g(s) + m for some integer m. 


Definition. Let F : Z x I —> X be a continuous map such that F(0,t) = F(1, t) 
for all ¢. Then for each z, the map f;(s) = F (s, t) is a loop in X. The map F is called 
a free homotopy between the loops fo and fı. It is a homotopy of loops in which the 
base point of the loop is allowed to move during the homotopy. 


Lemma 66.1. Let f bea loop in R? — a. 
(a) If f is the reverse of f , thenn(f,a) = —n(f, a). 
(b) If f is freely homotopic to f’, through loops lying in R? — a, thenn(f,a) = 
n(f', a). 
(c) Ifa and b lie in the same component of R? — f (J), thenn(f, a) =n(f, b). 
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Proof. (a) To compute n( f , a), one replace s by ! — s throughout the definition. This 
has the effect of changing g(1) — g(0) by a sign. 

(b) Let F be a free homotopy between f and f’. Define G : I x 1 — S! by the 
equation 


G(s, t) = [F(s, t) — a}/F(s, t) — all. 


Let G be a lifting of G to R. Then Ga, i- GO, t) is an integer for each t; being 
continuous, it is constant. 

(c) Let æ be a path in R? — f(/) froma to b. Note that by definition, n( f, a) = 
n(f —a,0). Since f(s) —a(t) is a free homotopy in R? — 0 between f — a and f —b, 
our result follows. a 


Definition. Let f be a loop in X. We call f a simple loop provided f(s) = f(s’) 
only if s = s’ or if one of the points s, s’ is 0 and the other is 1. If f is a simple loop, 
its image set is a simple closed curve in X. 


Theorem 66.2. Let f be asimple loop in R?. Ifa lies in the unbounded component of 
— f(), thenn(f, a) = 0; while if a lies in the bounded component, n( f, a) = 


Proof. Since n(f,a) = n(f — a, 9), we may restrict ourselves to the case a = 0. 
Furthermore, we may assume that the base point of f lies on the positive x-axis. For 
one can gradually rotate R? —0 until the base point of f is such a point; this modifies f 
by a free homotopy, so it does not affect the conclusion of the theorem. 

So let f be a simple loop in X = R? — 0 based at a point x9 of the positive x- 
axis. Let C be the simple closed curve f(/). We show that if 0 lies in the bounded 
component of R? — C, then [ f] generates 7(X, xo), while if 0 lies in the unbounded 
component, [f] is trivial. 

The map f induces, via the standard quotient map p : / > S!,a homeomorphism 

: S! —> C. The element [p] generates the fundamental group of S', so h,[p] 
generates the fundamental group of C. If 0 lies in the bounded component of R?_-C, 
Theorem 65.2 tells us that j,A.[p] = [f] generates the fundamental group of R? — 0, 
where j : C —> R? — 0 is the inclusion. On the other hand, if 0 lies in the unbounded 
component of R2-C, then joh is nulhomotopic by Lemma 61.2, so that [ f ] is trivial. 

Now we show that if [ f} generates 71 (X, xo), then n( f,0) = +1, while if [f] 
is trivial, n(f,0) = 0. Since the retraction x — x/|lxiį of R? — 0 onto S! induces 
an isomorphism of fundamental groups, the loop g(s) = f(s)/If(s)|| represents a 
generator of 7, (S', bo) in the first case, and the identity element in the second case. 
If we examine the isomorphism @ : x(S ! bo) — Z constructed in the proof of 
Theorem 54.5, we see this means that when we lift g to a path g in R beginning at 0, 
the path g ends at +1 in the first case, and at 0 in the second. a 


Definition. Let f be a simple loop in R?. We say f is a counterclockwise loop 
if n(f,a) = +1 for some a (and hence for every a) in the bounded component of 
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R? — f(/). We say it is a clockwise loop if n( f, a) = —1. The standard loop p(s) = 
(cos 27s, sin2z7rs) is thus a counterclockwise loop. 


Application to complex variables 


We now relate winding numbers to complex line integrals. 


Lemma 66.3. Let f be a piecewise-differentiable loop in the complex plane; let a 
be a point not in the image of f. Then 


l d 
n(f,a) E) : 
f 


2zi z-a 


This equation is often used as the definition of the winding number of f . 
Proof. The proof is a simple exercise in computation. Let p : R —> S! be the 
standard covering map. Let r(s) = || f(s) — al] and g(s) = [f (s) — a)/r(s). Let g be 
a lifting of g to R. Set 0(s) = 27 g(s). Then f(s) — a = r(s) exp(iO(s)), so that 


f 
J dz =f ((r'e!? + iro’e') sre! Ids 
f 0 


zZ-a 


= [log r(s) + 10(s)]} 
= i[8(1) — 8(0)] 
= 2mi[g(1) — g(0)). a 


Theorem 66.4 (Cauchy integral formula-classical version). Let C be a simple 
closed piecewise-differentiable curve in the complex plane. Let B be the bounded 
component of R? — C. If F(z) is analytic in an open set Q that contains B and C, then 
for each point a of B, 


1 f FO 
2ni Joz—a 


Flaj=+ dz. 


The sign is + if C is oriented counterclockwise, and — otherwise. 


Proof. We derive this formula from the version of it proved in Ahlfors [A], which is 
the following: 

Let F be analytic in a region Q. Let f be a piecewise-differentiable loop in Q. 
Assume that n( f, b) = 0 for each b not in Q. Ifa € Q and a is not in the image of f, 
then 


1 F 
na: Flay = 5 | FS as 
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We apply this result to a piecewise-differentiable parametrization f of our simple 
closed curve C. The condition n( f, b) = 0 holds for each b not in Q, since any such b 
lies in the unbounded component of R? — C. Furthermore, n( f.a) = +1 whenever 
a is in B, the sign depending on the orientation of C, by Theorem 66.2. The theorem 
follows. a 


Note that one cannot even state the classical version of the Cauchy integral theorem 
without knowing the Jordan curve theorem. To prove it requires even more, namely, 
knowledge of the winding number of a simple closed curve. It is of interest to note 
that this latter result can be proved (at least in the differentiable case) by an entirely 
different method, using the general version of Green's Theorem, proved in analysis. 
This proof is outlined in Exercise 2. 


Exercises 


1. Let f be a loop in R? — a; let g(s) = [f (s) —a]/I| f(s) — all The map g induces, 
via the standard quotient map p : | > S!, a continuous map h : S! —> Sl. 
Show that n( f, a) equals the degree of h, as defined in Exercise 9 of §58. 


2. This exercise assumes some familiarity with analysis on manifolds. 

Theorem. Let C be a simple closed curve in R? that is a smooth submanifold 

of R?; let f : | — C be a simple loop smoothly parameterizing C. If 0 is a point 

of the bounded component of R? — C, thenn(f,0) = +1. 

Proof. Let U be the bounded component of R? — C. Let B be a closed €-ball 

centered at 0 that lies in U; let S = Bd B. Let M equal the closure of U — B. 

(a) Show M is a smooth 2-manifold with boundary C U S. 

(b) Apply Green’s theorem to show that Ic dz/z = + f; dz/z, the sign depend- 
ing on the orientations of S and C. [Hint: Set P = —y/(x? + y?) and 
Q=x/(? +y?) 

(c) Show that the second integral equals +2773. 


Chapter 11 


The Seifert-van Kampen 
Theorem 


§67 Direct Sums of Abelian Groups 


In this section, we shall consider only groups that are abelian. As is usual, we shall 
write such groups additively. Then 0 denotes the identity element of the group, —x 
denotes the inverse of x, and nx denotes the n-fold sum x + --- +x. 

Suppose G is an abelian group, and {Ga}aey is an indexed family of subgroups 
of G. We say that the groups Ga generate G if every element x of G can be written as 
a finite sum of elements of the groups Ge. Since G is abelian, we can always rearrange 
such a sum to group together terms that belong to a single Ga; hence we can always 
write x in the form 


X = Xay Eeee Xans 


where the indices a; are distinct. In this case, we often write x as the formal sum 
IS Jae Xa, Where it is understood that x_ = 0 if æ is not one of the indices a, 
+15 Ope 

If the groups Ga generate G, we often say that G is the sum of the groups Ga, 
writing G = J „cj Ga in general, or G = G1 +--- + Gn in the case of the finite 
index set {1,..., n}. 

Now suppose that the groups Ga generate G, and that for each x € G, the expres- 
sion x = J` xa for x is unique. That is, suppose that for each x € G, there is only one 
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J-tuple (xa)ees With xa = 0 for all but finitely many a such that x = È xg. Then G 
is said to be the direct sum of the groups Ga, and we write 


C= Ga, 


acj 


or in the finite case, G = Gi @ ---@ Ga. 
EXAMPLE l. The cartesian product R® is an abelian group under the operation of 
coordinate-wise addition. The set G, consisting of those tuples (x,} such that x, = 0 for 
i Æ n is a subgroup isomorphic to R. The groups Gn generate ihe subgroup R” of R”; 
indeed, R® is their direct sum. 


A useful characterization of direct sums is given in the following lemma; we call 
it the extension condition for direct sums: 


Lemma 67.1. Let G be an abelian group; let {Gq} be a family of subgroups of G. If 
G is the direct sum of the groups Ga, then G satisfies the following condition: 


Given any abelian group H and any family of homomorphisms 
(*) ha : Ga —> H, there exists a homomorphism h : G —> H whose 
restriction to Ga equals ha, for each a. 


Furthermore, h is unique. Conversely, if the groups Ga generate G and the extension 
condition (*) holds, then G is the direct sum of the groups Ga. 


Proof. We show first that if G has the stated extension property, then G is the direct 
sum of the Gy. Suppose x = }> xa = $ ya; we show that for any particular index £, 
we have xg = yg. Let H denote the group Gg; and let hg : Ga — H be the 
trivial homomorphism for a # $, and the identity homomorphism fora = $. Let 
h : G — H be the hypothesized extension of the homomorphisms hg. Then 


h(x) = Yo haa) = Xz, 
h(x) =} ha Oa) = yp, 


so that xg = yg. 

Now we show that if G is the direct sum of the Ga, then the extension condition 
holds. Given homomorphisms hg, we define h(x) as follows: If x = J` xa, set h(x) = 
> hg (Xa). Because this sum is finite, it makes sense; because the expression for x is 
unique, h is well-defined. One checks readily that h is the desired homomorphism. 
Uniqueness follows by noting that h must satisfy this equation if it is a homomorphism 
that equals ha on Gy for each æ. a 


This lemma makes a number of results about direct sums quite easy to prove: 


Corollary 67.2. Let G = G, ® G2. Suppose G; is the direct sum of subgroups Hy 
fora € J, and G2 is the direct sum of subgroups Hg for B € K , where the index sets J 
and K are disjoint. Then G is the direct sum of the subgroups H,, fory € JUK. 
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Proof. If ha : Ha —> H and hg : Hg —> H are families of homomorphisms, they 
extend to homomorphisms hı : G; —> H and h2 : G2 > H by the preceding lemma. 
Then h; and h? extend to a homomorphism h : G > H. B 


This corollary implies, for example, that 


(G1 ® G2) @ G3 = G1 @ G2 © G3 = G © (G2 @ G3). 


Corollary 67.3. If G = Gi ® G2, then G/ Gz is isomorphic to Gy. 


Proof. Let H = G,, let hı : Gi —> H be the identity homomorphism, and let 
h2 : G2 > H be the trivial homomorphism. Let h : G —> H be their extension to G. 
Then A is surjective with kernel G2. a 


In many situations, one is given a family of abelian groups {Gae} and one wishes 
to find a group G that contains subgroups G, isomorphic to the groups Ge , such that 
G is the direct sum of these subgroups. This can in fact always be done; it leads to a 
notion called the external direct sum. 


Definition. Let (Gajacy be an indexed family of abelian groups. Suppose that G is 
an abelian group, and that ig : Ga —> G isa family of monomorphisms, such that G 
is the direct sum of the groups ig(Gq). Then we say that G is the external direct sum 
of the groups Ga, relative to the monomorphisms ig. 


The group G is not unique, of course; we show later that it is unique up to isomor- 
phism. Here is one way of constructing G: 


Theorem 67.4. Given a family of abelian groups {Ga}ae j, there exists an abelian 
group G and a family of monomorphisms ig : Ga — G such that G is the direct sum 
of the groups ig (Ga). 


Proof. Consider first the cartesian product 


ĮI Ga: 


ael 


it is an abelian group if we add two J-tuples by adding them coordinate-wise. Let G 
denote the subgroup of the cartesian product consisting of those tuples (xa )aey such 
that xy = Og, the identity element of Ga, for all but finitely many values of a. Given 
an index £, define ig : Gg —> G by letting ig(x) be the tuple that has x as its Bth 
coordinate and Ox as its ath coordinate for alla # £. It is immediate that ig is a 
monomorphism. It is also immediate that since each element x of G has only finitely 
many nonzero coordinates, x can be written uniquely as a finite sum of elements from 
the groups ig(Gg). a 
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The extension condition that characterizes ordinary direct sums translates imme- 
diately into an extension condition for external direct sums: 


Lemma 67.5. Let {Gulaey be an indexed family of abelian groups; let G be an 
abelian group; let ig : Ga —> G be a family of homomorphisms. If each ig is a 
monomorphism and G is the direct sum of the groups ig(Gq), then G satisfies the 
following extension condition: 


Given any abelian group H and any family of homomorphisms hg : 
(*) Ga —> H, there exists a homomorphism h : G —> H such that 
h oia = ha for each a. 


Furthermore, h is unique. Conversely, suppose the groups ia (Ga) generate G and the 
extension condition (*) holds. Then each ig is a monomorphism, and G is the direct 
sum of the groups ie (Ga). 


Proof. The only part that requires proof is the statement that if the extension con- 
dition holds, then each i, is a monomorphism. That is proved as follows. Given an 
index £, set H = Gg and let hg : Ga —> H be the identity homomorphism if a = £, 
and the tnvial homomorphism if a # $. Leth : G > H be the hypothesized exten- 
sion. Then in particular, h o ig = hg; it follows that ig is injective. a 


An immediate consequence is a uniqueness theorem for direct sums: 


Theorem 67.6 (Uniqueness of direct sums). Let {Ga}aeys be a family of abelian 
groups. Suppose G and G’ are abelian groups and ig : Ga —> G andi, : Ga > G’ 
are families of monomorphisms, such that G is the direct sum of the groups ig(Ga) 
and G' is the direct sum of the groups i,(Gqa). Then there is a unique isomorphism 
$ : G— G' such that $ o ia = i, for eacha. 


Proof. We apply the preceding lemma (four times!). Since G is the external direct 
sum of the Gg and {i/ } is a family of homomorphisms, there exists a unique homomor- 
phism @ : G —> G’ such that ġ oie = i, for each æ. Similarly, since G’ is the external 
direct sum of the Gg and {ig} is a family of homomorphisms, there exists a unique 
homomorphism wy : G’ > G such that Y oi) = ig for each a. Now yog:G>G 
has the property that y o ġ oig = ig for each æ; since the identity map of G has 
the same property, the uniqueness part of the lemma shows that y o ¢ must equal the 
identity map of G. Similarly, $ o y must equal the identity map of G’. a 


If G is the external direct sum of the groups Gg, telative to the monomorphisms ig, 
we sometimes abuse notation and write G = @ Ga, even though the groups Ge are 
not subgroups of G. That is, we identify each group Ga with its image under ig, and 
treat G as an ordinary direct sum rather than an external direct sum. In each case, the 
context will make the meaning clear. 

Now we discuss free abelian groups. 
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Definition. Let G be an abelian group and let {ax} be an indexed family of elements 
of G; let Ga be the subgroup of G generated by ag. If the groups Gg generate G, we 
also say that the elements ag generate G. If each group Gz is infinite cyclic, and if G 
is the direct sum of the groups Ga, then G is said to be a free abelian group having 
the elements {aq} as a basis. 


The extension condition for direct sums implies the following extension condition 
for free abelian groups: 


Lemma 67.7. Let G be an abelian group; let {aa}ac; be a family of elements of G 
that generates G. Then G is a free abelian group with basis {ag} if and only if for any 
abelian group H and any family {Ya} of elements of H, there is a homomorphism h 
of G into H such that h(aq) = Ya for each a. In such case, h is unique. 


Proof. Let Ga denote the subgroup of G generated by ay. Suppose first that the 
extension property holds. We show first that each group Gg is infinite cyclic. Suppose 
that for some index £, the element ag generates a finite cyclic subgroup of G. Then 
if we set H = Z, there is no homomorphism h : G > H that maps each ag to the 
number |. For ag has finite order and 1 does not! To show that G is the direct sum of 
the groups Ga, we merely apply Lemma 67.1. 

Conversely, if G is free abelian with basis {ag}, then given the elements {ya} of 
H, there are homomorphisms hg : Ga —> H such that ha (aa) = Ya (because Gy is 


infinite cyclic). Then Lemma 67.1 applies. a 
Theorem 67.8. IfG is a free abelian group with basis {a,,..., an}, thenn is uniquely 
determined by G. 


Proof. The group G is isomorphic to the n-fold product Z x - - - x Z; the subgroup 2G 
corresponds to the product (2Z) x --- x (2Z). Then the quotient group G/2G is 
in bijective correspondence with the set (Z/2Z) x --- x (Z/2Z), so that G/2G has 
cardinality 2”. Thus n is uniquely determined by G. a 


If G is a free abelian group with a finite basis, the number of elements in a basis 
for G is called the rank of G. 


Exercises 
1. Suppose that G = }_ Ga. Show this sum is direct if and only if the equation 
Xa, +++ +X, =0 


implies that each xa, equals 0. (Here xo; € Ga, and the indices a; are distinct.) 


2. Show that if G, is a subgroup of G, there may be no subgroup G2 of G such that 
G = G @ G2. [Hint: Set G = Z and G; = 2Z.] 
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3. If G is free abelian with basis {x, y}, show that {2x + 3y, x — y} is also a basis 
for G. 


4. The order of an element a of an abelian group G is the smallest positive integer m 
such that ma = 0, if such exists; otherwise, the order of a is said to be infinite. 
The order of a thus equals the order of the subgroup generated by a. 

(a) Show the elements of finite order in G form a subgroup of G, called its 
torsion subgroup. 

(b) Show that if G is free abelian, it has no elements of finite order. 

(c) Show the additive group of rationals has no elements of finite order, but is 
not free abelian. [Hint: If {aq} is a basis, express lau in terms of this basis.} 


§. Give an example of a free abelian group G of rank n having a subgroup H of 
rank n for which H Æ G. 


6. Prove the following: 
Theorem. If A is a free abelian group of rank n, then any subgroup B of A is a 
free abelian group of rank at most n. 
Proof. We can assume A = Z”, the n-fold cartesian product of Z with itself. Let 
xi : Z" — Z be projection on the ith coordinate. Given m < n, let Bm consist 
of all elements x of B such that x; (x) = 0 fori > m. Then Bm is a subgroup 
of B. 

Consider the subgroup 7%m(Bm) of Z. If this subgroup is nontrivial, choose 
Xm € Bm So that (Xm) is a generator of this subgroup. Otherwise, set Xm = 0. 
(a) Show {x,,..., Xm} generates Bm, for each m. 

(b) Show the nonzero elements of {x;, ..., Xm} form a basis for B,,, for each m. 
(c) Show that B, = B is free abelian with rank at most n. 


§68 Free Products of Groups 


We now consider groups G that are not necessarily abelian. In this case, we write G 
multiplicatively. We denote the identity element of G by l, and the inverse of the 
element x by x~!. The symbol x” denotes the n-fold product of x with itself, x~” 
denotes the n-fold product of x~! with itself, and x? denotes 1. 

In this section, we study a concept that plays a role for arbitrary groups similar to 
that played by the direct sum for abelian groups. It is called the free product of groups. 

Let G be a group. If {Ga}aey is a family of subgroups of G, we say (as before) 
that these groups generate G if every element x of G can be written as a finite product 
of elements of the groups Ga. This means that there is a finite sequence (x1, ..., Xn) 
of elements of the groups Ga such that x = x;---x,. Such a sequence is called a 
word (of length n) in the groups Ga; it is said to represent the element x of G. 

Note that because we lack commutativity, we cannot rearrange the factors in the 
expression for x so as to group together factors that belong to a single one of the groups 
Ga. However, if x; and x;,; both belong to the same group Go, we can group them 
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together, thereby obtaining the word 
(Xi, MID, MIME HL Xi+2, 0+, Xn), 


of length n — 1, which also represents x. Furthermore, if any x; equals 1, we can 
delete x; from the sequence, again obtaining a shorter word that represents x. 
Applying these reduction operations repeatedly, one can in general obtain a word 
representing x of the form (y1, ..., Ym), where no group Ga contains both y; and yi+1, 
and where y; # | for all i. Such a word is called a reduced word. This discussion 
does not apply, however, if x is the identity element of G. For in that case, one might 
represent x by a word such as (a, a~'), which reduces successively to the word (aa~') 
of length one, and then disappears altogether! Accordingly, we make the convention 
that the empty set is considered to be a reduced word (of length zero) that represents the 
identity element of G. With this convention, it is true that if the groups Ga generate G, 
then every element of G can be represented by a reduced word in the elements of the 


groups Ge. 
Note that if (x1, ... , Xa) and (y1, ..., Ym) are words representing x and y, respec- 
tively, then (x1,..., Xn, Yl, <- -, Ym) is a word representing xy. Even if the first two 


words are reduced words, however, the third will not be a reduced word unless none 
of the groups Ge contains both x, and yı. 


Definition. Let G be a group, let (Galaey be a family of subgroups of G that gener- 
ates G. Suppose that Ga N Gg consists of the identity element alone whenever a # £. 
We say that G is the free product of the groups Ga if for each x € G, there is only 
one reduced word in the groups Ga that represents x. In this case, we write 


* 
G=|[Ge. 


aes 
or in the finite case, G = G, *--- * Gy. 


Let G be the free product of the groups Gg, and let (x), .. . , Xn) be a word in the 
groups Ga satisfying the condition x, Æ 1 for all i. Then, for each i, there is a unique 
index a; such that x; € Ga;; to say the word is a reduced word is to say simply that 
a; # aj4, for each i. 

Suppose the groups Ga generate G, where Ga N Gg = {1} fora # B. In order 
for G to be the free product of these groups, it suffices to know that the representation 
of | by the empty word is unique. For suppose this weaker condition holds, and 
suppose that (x1, . -. , Xn) and (y1,..., Ym) are two reduced words that represent the 
same element x of G. Let a; and 8; be the indices such that x; € Ga, and y; € Gg. 
Since 


the word 


414 The Seifert-van Kampen Theorem Ch. l1 


represents 1. It must be possible to reduce this word, so we must have a; = £1; the 
word then reduces to the word 


Onh YT Xb -+ Xn). 


Again, it must be possible to reduce this word, so we must have y = 1. Then 
xı = yı, so that 1 is represented by the word 


Cin asks Dy X2 -+s Xn). 


The argument continues similarly. One concludes finally that m = n and x; = y; for 
all i. 
EXAMPLE !. Consider the group P of bijections of the set (0, 1, 2) with itself. For 
i = l, 2, define an element 2, of P by setting m, (i) = i — l and m (i — 1) = i and 
mi(j) = j otherwise. Then x, generates a subgroup G, of P of order 2. The groups G, 
and G2 generate P, aS you can check. But P is not their free product. The reduced words 
(1, 72, 4) and (72, m1, 2), for instance, represent the same element of P. 


The free product satisfies an extension condition analogous to that satisfied by the 
direct sum: 


Lemma 68.1. Let G be a group; let {Gq} be a family of subgroups of G. If G is the 
free product of the groups Ga, then G satisfies the following condition: 


Given any group H and any family of homomorphisms ha : Ga > 
(*) H, there exists a homomorphism h : G —> H whose restriction to Ga 
equals ha, for each a. 


Furthermore, h is unique. 


The converse of this lemma holds, but the proof is not as easy as it was for direct 
sums. We postpone it until later. 
Proof. Given x € G with x # 1, let (xı, ..., Xn) be the reduced word that repre- 
sents x. If A exists, it must satisfy the equation 


(*) h(x) = h(x1)---h(Xn) = ha, (x1) > Rag (Xn), 


where a; is the index such that x, € Ga,- Hence h is unique. 

To show h exists, we define it by equation (*) if x 4 1, and we set A(1) = 1. 
Because the representation of x by a reduced word is unique, h is well-defined. We 
must show it is a homomorphism. 

We first prove a preliminary result. Given a word w = (x1,..., Xn) of positive 
length in the elements of the groups Ga, let us define ¢(w) to be the element of H 
given by the equation 


(**) o(w) = hai (41) -han (Xn), 
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where æ; is any index such that x; € Gg,. Now a; is unique unless x; = 1; hence 
is well-defined. If w is the empty word, let ¢(w) equal the identity element of H. We 
show that if w’ is a word obtained from w by applying one of our reduction operations, 
p(w) = ġ (w). 

Suppose first that w’ is obtained by deleting x; = 1 from the word w. Then the 
equation ¢(w’) = ¢(w) follows from the fact that ha, (x,) = 1. Second, suppose that 
a; = a4, and that 


w’ = (X1, 0, X Xia 03 Xn) 
The fact that 
ha(Xi)ha(xi+1) = ha (XiXi+1), 


where æ = a; = ai+1, implies that ø (w) = ġ(w'). 

It follows at once that if w is any word in the groups Ga that represents x, then 
h(x) = (w). For by definition of h, this equation holds for any reduced word w; and 
the process of reduction does not change the value of ¢. 


Now we show that A is a homomorphism. Suppose that w = (x),...,x,) and 
w = (y1,---, Ym) are words representing x and y, respectively. Let (w, w’) denote 
the word (x1, ..., Xn, Y1, - --, Ym), Which represents xy. It follows from equation (**) 
that ø (w, w’) = $(w)p(w’). Then A(xy) = h(x)A(y). a 


We now consider the problem of taking an arbitrary family of groups {Ga} and 
finding a group G that contains subgroups G, isomorphic to the groups Ga, such that 
G is the free product of the groups Gq. This can, in fact, be done; it leads to the notion 
of external free product. 


Definition. Let (Ga}acy be an indexed family of groups. Suppose that G is a group, 
and that ix : Ga — G isa family of monomorphisms, such that G is the free product of 
the groups ia (Ga). Then we say that G is the external free product of the groups Ga, 
relative to the monomorphisms ig. 


The group G is not unique, of course; we show later that it is unique up to iso- 
morphism. Constructing G is much more difficult than constructing the external direct 
sum was: 


Theorem 68.2. Given a family {Ga}ac; of groups, there exists a group G and a 
family of monomorphisms ig : Ga —> G such that G is the free product of the groups 
ta(Ga). 


Proof. For convenience, we assume that the groups Gg are disjoint as sets. (This can 
be accomplished by replacing Ga by Ga x {a} for each index a, if necessary.) 

Then as before, we define a word (of length n) in the elements of the groups Ge 
to be an n-tuple w = (x1,.-.., Xn) of elements of |] Gg. It is called a reduced word 
if a; Æ aj4, for all i, where a; is the index such that x; € Ga,, and if for each i, x; 
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is not the identity element of Ga;. We define the empty set to be the unique reduced 
word of length zero. Note that we are not given a group G that contains all the Ga as 
subgroups, so we cannot speak of a word “representing” an element of G. 

Let W denote the set of all reduced words in the elements of the groups Ga. Let 
P(W) denote the set of all bijective functions 7 : W —> W. Then P(W) is itself 
a group, with composition of functions as the group operation. We shall obtain our 
desired group G as a subgroup of P(W). 

Step 1. For each index œ and each x € Ga, we define a set map 7, : W — W. It 
will satisfy the following conditions: 

(1) If x = la, the identity element of Ga, then 7, is the identity map of W. 
(2) If x, y € Ga and z = xy, then a, = Ty O Ty. 

We proceed as follows: Let x € Ga. For notational purposes, let w = (x1, ..., Xn) 
denote the general nonempty element of W, and let a, denote the index such that 
x, € Ga,- If x Æ la, define x, as follows: 


(i) 1x(O) = (x), 

(ii) 1, (w) = (X, X1, --., Xn) ifaj; #a, 

(iii) Ty, (w) = (xx1,..-, Xn) ifa; =a and xı #x~!, 
(iv) Ty (w) = (X2, ..., Xn) if a, =a and xı axl, 


If x = la, define 7, to be the identity map of W. 

Note that the value of 7, is in each case a reduced word, that is, an element of W. 
In cases (i) and (ii), the action of 7, increases the length of the word; in case (iii) it 
leaves the length unchanged, and in case (iv) it reduces the length of the word. When 
case (iv) applies to a word w of length one, it maps w to the empty word. 


Step 2. We show that if x, y € Ga and z = xy, then z, =m, omy. 

The result is trivial if either x or y equals la, since in that case m, or my is the 
identity map. So let us assume henceforth that x 4 la and y Æ le. We compute the 
values of 7, and of 1, o my on the reduced word w. There are four cases to consider. 

(i) Suppose w is the empty word. We have my(@) = (y). If z = la, then y = x7 
and 1,7,(@) = Ø by (iv), while 2,(@) equals the same thing because 7; is the 
identity map. If z Æ la, then 


1 


nxty(D) = (xy) = (z) = 2,(B). 


In the remaining cases, we assume w = (41 ..., Xn), with x1 E€ Ga,. 
(ii) Suppose @ 4 a. Then zy(w) = (Y, X1, .. -> Xn). If z = la, then y = xT! 
and 7, my(w) = (x1,...,Xn) by (iv), while x, (w) equals the same because 7, is the 


identity map. If z Æ la, then 


1 Ry(w) = (ry, X],..., Xn) 
= (Z,X1,-..,4%n) = 7z(w). 
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(iii) Suppose a = a and yx; Æ la. Then zy(w) = (yx1, x2, ..., Xn). If xyx1 
la, then 7, ay(w) = (x2,...,%n), while x(w) equals the same thing because zx1 
xyx, = læ. If xyxı Æ le, then 


toi 


Wy Hy(w) = (XyX1, X2, . Xn) 
= (2X1, X2,.-.,Xn) = m (w). 


(iv) Finally, suppose a = a; and yx; = la. Then my(w) = (x2,..., Xa), which is 
empty if n = 1. We compute 


yTy(w) = (X, X2,..-, Xn) 
= (x(yx1), X2,- Xn) 
= (2X1, X2, -- -s Xn) = M (w). 


Step 3. The map 7, is an element of p(W), and the map ie : Ga —> P(W) defined 
by ig (x) = 7, is a monomorphism. 

To show that 77, is bijective, we note that if y = x~!, then conditions (1) and (2) 
imply that 7,07, and 7,07, equal the identity map of W. Hence 7, belongs to P(W). 
The fact that ig is a homomorphism is a consequence of condition (2). To show that ig 
is a monomorphism, we note that if x # lq, then 7,(@) = (x), so that m, is not the 
identity map of W. 

Step 4. Let G be the subgroup of P(W) generated by the groups G = ia (Ga). 
We show that G is the free product of the groups Gj. 

First, we show that G}, N G, consists of the identity alone ifa Æ B. Letx € Gy 
and y € Gg; we suppose that neither 77, nor zy is the identity map of W and show that 
Xx # My. But this is easy, for 7,(@) = (x) and 7,(@) = (y), and these are different 
words. 

Second, we show that no nonempty reduced word 


i 
w = (Axs -es MÆxp) 


in the groups G, represents the identity element of G. Let a; be the index such that 
Xi € Ga;; then a; Æ aj41 and x; # la, for each i. We compute 


My, (Axl ++ x (B)))) = 1... - Xn), 


so the element of G represented by w’ is not the identity element of P(W). B 


Although this proof of the existence of free products is certainly correct, it has the 
disadvantage that it doesn’t provide us with a convenient way of thinking about the 
elements of the free product. For many purposes this doesn’t matter, for the extension 
condition is the crucial property that is used in the applications. Nevertheless, one 
would be more comfortable having a more concrete model for the free product. 

For the external direct sum, one had such a model. Fhe external direct sum of 
the abelian groups Ga consisted of those elements (xq) of the cartesian product [] Ga 
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such that Xe = Og for all but finitely many a. And each group Gg was isomorphic to 
the subgroup Gs consisting of those (xq) such that xa = Og for alla # $. 

Is there a similar simple model for the free product? Yes. In the last step of the 
preceding proof, we showed that if (7tx,, ..., %x,) is a reduced word in the groups Gj, 
then 


Tx, (xy (- -- (Axa (DDD = (x1, ---, Xn). 


This equation implies that if x is any element of P(W) belonging to the free prod- 
uct G, then the assignment 7 — 7r (Ø) defines a bijective correspondence between G 
and the set W itself! Furthermore, if 2 and x’ are two elements of G such that 


m(Q) = (x1,..-,%n) and = -’(S) = (yi... ye), 


then z(7'(Ø)) is the word obtained by taking the word (11,...,%n, Yi,---, yk) and 
reducing it! 

This gives us a way of thinking about the group G. One can think of G as being 
simply the set W itself, with the product of two words obtained by juxtaposing them 
and reducing the result. The identity element corresponds to the empty word. And 
each group Gg corresponds to the subset of W consisting of the empty set and all 
words of length | of the form (x), for x € Gg and x 1. 

An immediate question arises: Why didn’t we use this notion as our definition of 
the free product? It certainly seems simpler than going by way of the group P(W) 
of permutations of W. The answer is this: Verification of the group axioms is very 
difficult if one uses this as the definition; associativity in particular is horrendous. The 
preceding proof of the existence of free products is a model of simplicity and elegance 
by comparison! 

The extension condition for ordinary free products translates immediately into an 
extension condition for external free products: 


Lemma 68.3. Let {Gq} be a family of groups; let G be a group; let ia : Ga + G be 
a family of homomorphisms. If each ig is a monomorphism and G is the free product 
of the groups ig (Gq), then G satisfies the following condition: 


Given a group H and a family of homomorphisms ha : Ga > H, 
(*) there exists a homomorphism h : G > H such that h o ie = ha for 
eacha. 
Furthermore, h is unique. 


An immediate consequence is a uniqueness theorem for free products; the proof is 
very similar to the corresponding proof for direct sums and is left to the reader. 


Theorem 68.4 (Uniqueness of free products). Let {Ga}ae; be a family of groups. 
Suppose G and G' are groups and ig : Ga —> G andi, : Gy —> G’ are families 
of monomorphisms, such that the families {ia(Gq)} and {i} (Ga)} generate G and G’, 
respectively. If both G and G’ have the extension property stated in the preceding 
lemma, then there is a unique isomorphism ¢ : G —> G’ such that ġ oia = ij, for alla. 
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Now, finally, we can prove that the extension condition characterizes free products, 
proving the converses of Lemmas 68.1 and 68.3. 


Lemma 68.5. Let {Gajaeys be a family of groups; let G be a group; let ig : Ga > G 
be a family of homomorphisms. If the extension condition of Lemma 68.3 holds, then 
each ig is a monomorphism and G is the free product of the groups ia (Ga). 


Proof. We first show that each ig is a monomorphism. Given an index £, let us set 
H = Gg. Let ha : Ga — H be the identity if œ = £, and the trivial homomorphism 
ifa Æ B. Leth : G — H be the homomorphism given by the extension condition. 
Then h o ig = hg, So that ig is injective. 

By Theorem 68.2, there exists a group G’ anda family i, : Ge —> G’ of monomor- 
phisms such that G” is the free product of the groups i, (Gq). Both G and G’ have the 
extension property of Lemma 68.3. The preceding theorem then implies that there is 
an isomorphism ¢ : G — G’ such that ¢ o ie = ij. It follows at once that G is the 
free product of the groups ig(Ga). ĀE 


We now prove two results analogous to Corollaries 67.2 and 67.3. 


Corollary 68.6. Let G = Gı * G2, where G, is the free product of the subgroups 
{Halaes and G2 is the free product of the subgroups {Hg}gex. If the index sets J 
and K are disjoint, then G is the free product of the subgroups {Hy}yesux- 


Proof. The proof is almost a copy of the proof of Corollary 67.2. a 


This result implies in particular that 
G, * G2 * G3 = G1 * (G2 * G3) = (G; * G2) * G3. 


In order to state the next theorem, we must recall some terminology from group 
theory. If x and y are elements of a group G, we say that y is conjugate to x if y = 
cxc7! for some c € G. A normal subgroup of G is one that contains all conjugates of 
its elements. 

If S is a subset of G, one can consider the intersection N of all normal subgroups 
of G that contain S. It is easy to see that N is itself a normal subgroup of G; itis called 
the least normal subgroup of G that contains S. 


Theorem 68.7. Let G = G, * G2. Let Ni be a normal subgroup of G;, fori = 1, 2. 
If N is the least normal subgroup of G that contains N, and N2, then 


G/N = (Gi/N1) * (G2/N2). 
Proof. The composite of the inclusion and projection homomorphisms 


G, — G, * G2 — (G1 * G2)/N 
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carries N; to the identity element, so that it induces a homomorphism 
i, : G/N — (G, * G2)/N. 


Similarly, the composite of the inclusion and projection homomorphisms induces a 
homomorphism 


i2 : G2/N2 — (G, * G2)/N. 


We show that the extension condition of Lemma 68.5 holds with respect to i; and iz; 
it follows that i; and i2 are monomorphisms and that (G; * G2)/N is the external free 
product of G1/N, and G2/N2 relative to these monomorphisms. 

So let hy : Gi/Nı —> H and h2 : G2/N2 —> H be arbitrary homomorphisms. 
The extension condition for G, * G2 implies that there is a homomorphism of G; * G2 
into H that equals the composite 


G; — G;/N, — H 


of the projection map and h; on G;, for i = 1, 2. This homomorphism carries the 
elements of N, and N3 to the identity element, so its kernel contains N. Therefore 
it induces a homomorphism h : (Gi x G2)/N — H that satisfies the conditions 
hy = hoi; and h? = hoi. a 


Corollary 68.8. If N is the least normal subgroup of G, * G2 that contains G,, then 
(Gi * G2)/N = G2. 


The notion of “least normal subgroup” is a concept that will appear frequently as 
we proceed. Obviously, if N is the least normal subgroup of G containing the subset S 
of G, then N contains S and all conjugates of elements of S. For later use, we now 
verify that these elements actually generate N. 


Lemma 68.9. Let S be a subset of the group G. If N is the least normal subgroup 
of G containing S, then N is generated by all conjugates of elements of S. 


Proof. Let N’ be the subgroup of G generated by all conjugates of elements of S. 
We know that N’ C N; to verify the reverse inclusion, we need merely show that N’ 
is normal in G. Given x € N’ and c € G, we show that cxe~! € N’. 

We can write x in the form x = x1X2---Xn, where each x; is conjugate to an 
element s; of S. Then cx;c7! is also conjugate to s;. Because 


exc! = (exyc7')(ex2e7!) tee (exac !), 


cxc7! is a product of conjugates of elements of S, so that cxc™! € N’, as desired. $ 
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Exercises 


1. Check the details of Example 1. 
2. Let G = G, * G2, where G; and G2 are nontrivial groups. 

(a) Show G is not abelian. 

(b) If x € G, define the length of x to be the length of the unique reduced word 
in the elements of G; and G2 that represents x. Show that if x has even 
length (at least 2), then x does not have finite order. Show that if x has odd 
length, then x is conjugate to an element of shorter length. 

(c) Show that the only elements of G that have finite order are the elements 
of G; and G3 that have finite order, and their conjugates. 

3. Let G = G, * G2. Given c € G, let cGc7! denote the set of all elements of 
the form cxc~!, for x € G,. It is a subgroup of G; show that its intersection 
with G2 consists of the identity alone. 


4. Prove Theorem 68.4. 
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Let G be a group; let {ay} be a family of elements of G, fora € J. We say the 
elements {aq} generate G if every element of G can be written as a product of powers 
of the elements ag. If the family {aa} is finite, we say G is finitely generated. 


Definition. Let {aa} be a family of elements of a group G. Suppose each a, generates 
an infinite cyclic subgroup Ga of G. If G is the free product of the groups {Ga}, then 
G is said to be a free group, and the family {aq} is called a system of free generators 
for G. 


In this case, for each element x of G, there is a unique reduced word in the ele- 
ments of the groups Ge that represents x. This says that if x # 1, then x can be written 
uniquely in the form 


x= (@a,)"! aed (aa), 


where a; Æ &;41 and n; Æ 0 for each i. (Of course, n, may be negative.) 
Free groups are characterized by the following extension property: 


Lemma 69.1. Let G be a group; let {ag)acy be a family of elements of G. If G 
is a free group with system of free generators {aq}, then G satisfies the following 
condition: 

Given any group H and any family {ya} of elements of H, there is a 
homomorphism h : G — H such that h(aq) = Ya for each a. 


(*) 


Furthermore, h is unique. Conversely, if the extension condition (+) holds, then G is a 
free group with system of free generators {aq}. 
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Proof. If G is free, then for each a, the group Ga generated by ag is infinite cyclic, 
so there is a homomorphism fa : Gg —> H with halaa) = Ya. Then Lemma 68.1 
applies. To prove the converse, let £ be a fixed index. By hypothesis, there exists a 
homomorphism h : G — Z such that h(ag) = 1 and h(aa) = 0 fora # £. It follows 
that the group Gg is infinite cyclic. Then Lemma 68.5 applies. a 


The results of the preceding section (in particular, Corollary 68.6) imply the fol- 
lowing: 


Theorem 69.2. Let G = Gi * G2, where G, and Gz are free groups with {aq )aes 
and {aa}acKk as respective systems of free generators. If J and K are disjoint, then G 
is a free group with (aq )aesuxK as a system of free generators. 


Definition. Let {aa}aey be an arbitrary indexed family. Let Ga denote the set of all 
symbols of the form a% for n € Z. We make Ga into a group by defining 
m nm . 


"aa = Ay 


ay 
Then a? is the identity element of Ga, and a," is the inverse of až. We denote a! 
simply by ag. The external free product of the groups {Gq} is called the free group 


on the elements ag. 


If G is the free group on the elements ag, we normally abuse notation and identify 
the elements of the group Ga with their images under the monomorphism ig : Ga > 
G involved in the construction of the external free product. Then each ag is treated as 
an element of G, and the family {aq} forms a system of free generators for G. 

There is an important connection between free groups and free abelian groups. In 
order to describe it, we must recall the notion of commutator subgroup from algebra. 


Definition. Let G be a group. If x, y € G, we denote by {x, y} the element 


[x,yl=xyx lye! 
of G; it is called the commutator of x and y. The subgroup of G generated by the set 
of all commutators in G is called the commutator subgroup of G and denoted [G, G]. 


The following result may be familiar; we provide a proof, for completeness: 


Lemma 69.3. Given G, the subgroup [G, G] is a normal subgroup of G and the quo- 
tient group G/[G, G] is abelian. Ifh : G > H is any homomorphism from G to an 
abelian group H, then the kernel of h contains [G, G}, so h induces a homomorphism 
k : G/(G,G] > H. 
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Proof. Step 1. First we show that any conjugate of a commutator is in [G, G]. We 
compute as follows: 


1 -! 


= g(xyx7'y7g 
= (gxyx!)((y7'g7!) 

= (gxyx—')(g7!y7!yg)(y7'g7!) 
= ((gx)y(gx)!y7')(vgy7'g7!) 
= (gx, y) - [y, gl. 

which is in {G, G], as desired. 


Step 2. We show that [G, G] is a normal subgroup of G. Let z be an arbitrary 
element of [G, G]; we show that any conjugate gzg~! of z is also in {G, G]. The 
element z is a product of commutators and their inverses. Because 


lx, ylz 


[x, y1} = (yx ty) = [y, x], 
z actually equals a product of commutators. Let z = z)---Z,, where each z; is a 
commutator. Then 
gee! = (82187) (82287) + (gzng'), 

which is a product of elements of [G, G] by Step 1 and hence belongs to [G , G). 

Step 3. We show that G/[G, G] is abelian. Let G’ = [G, G]; we wish to show that 

(aG’)(bG’) = (6G')\(aG’), 
that is, abG’ = baG’. This is equivalent to the equation 
a'b-'abG' =G’, 

and this equation follows from the fact that a 'b-'ab = [a7!, b7!], which is an 


element of G’. 


Step 4. To complete the proof, we note that because H is abelian, h carries each 
commutator to the identity element of H. Hence the kernel of A contains [G, G), so 
that h induces the desired homomorphism k . a 


Theorem 69.4. IfG is a free group with free generators aa, then G/(G, G] is a free 
abelian group with basis [aq], where (aq) denotes the coset of aa in G/(G, G]. 

Proof. We apply Lemma 67.7. Given any family {ya} of elements of the abelian 
group H, there exists a homomorphism h : G — H such that h(ag) = Ya for each a. 
Because H is abelian, the kernel of h contains [G, G]; therefore h induces a homo- 
morphism k : G/[G, G] —> H that carries [aa] to ye. a 


Corollary 69.5. If G is a free group with n free generators, then any system of free 
generators for G hasn elements. 


Proof. The free abelian group G/[G, G] has rank n. a 
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The properties of free groups are in many ways similar to those of free abelian 
groups. For instance, if H is a subgroup of a free abelian group G, then H itself is 
a free abelian group. (The proof in the case where G has finite rank is outlined in 
Exercise 6 of §67; the proof in the general case is similar.) The analogous result holds 
for free groups, but the proof is considerably more difficult. We shall give a proof in 
Chapter 14 that is based on the theory of covering spaces. 

In other ways, free groups are very different from free abelian groups. Given a free 
abelian group of rank n, the rank of any subgroup is at most n; but the analogous result 
for free groups does not hold. If G is a free group with a system of n free generators, 
then the cardinality of a system of free generators for a subgroup of G may be greater 
than n; it may even be infinite! We shall explore this situation later. 


Generators and relations 


A basic problem in group theory is to determine, for two given groups, whether or not 
they are isomorphic. For free abelian groups, the problem is solved; two such groups 
are isomorphic if and only if they have bases with the same cardinality. Similarly, two 
free groups are isomorphic if and only if their systems of free generators have the same 
cardinality. (We have proved these facts in the case of finite cardinality.) 

For arbitrary groups, however the answer is not so simple. Only in the case of an 
abelian group that is finitely generated is there a clear-cut answer. 

If G is abelian and finitely generated, then there is a fundamental theorem to the 
effect that G is the direct sum of two subgroups, G = H @ T, where H is free abelian 
of finite rank, and T is the subgroup of G consisting of all elements of finite order. (We 
call T the torsion subgroup of G.) The rank of H is uniquely determined by G, since 
it equals the rank of the quotient of G by its torsion subgroup. This number is often 
called the betti number of G. Furthermore, the subgroup T is itself a direct sum; it 
is the direct sum of a finite number of finite cyclic groups whose orders are powers of 
primes. The orders of these groups are uniquely determined by T (and hence by G), 
and are called the elementary divisors of G. Thus the isomorphism class of G is 
completely determined by specifying its betti number and its elementary divisors. 

If G is not abelian, matters are not nearly so satisfactory, even if G is finitely 
generated. What can we specify that will determine G? The best we can do is the 
following: 

Given G, suppose we are given a family {ae}ae, of generators for G. Let F be the 
free group on the elements {aa}. Then the obvious map h(aq) = aq of these elements 
into G extends to a homomorphism A : F — G that is surjective. If N equals the 
kernel of h, then F/N = G. So one way of specifying G is to give a family {aq} 
of generators for G, and somehow to specify the subgroup N. Each element of N is 
called a relation on F, and N is called the relations subgroup. We can specify N by 
giving a set of generators for N. But since N is normal in F, we can also specify N 
by a smaller set. Specifically, we can specify N by giving a family {rg} of elements 
of F such that these elements and their conjugates generate N, that is, such that N is 
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the least normal subgroup of F that contains the elements rg. In this case, we call the 
family {rg} a complete set of relations for G. 

Each element of N belongs to F, so it can of course be represented uniquely by a 
reduced word in powers of the generators {ag}. When we speak of a relation on the 
generators of G, we sometimes refer to this reduced word, rather than to the element 
of N it represents. The context will make the meaning clear. 


Definition. If G is a group, a presentation of G consists of a family {aœ } of gen- 
erators for G, along with a complete set {rg} of relations for G, where each rg is an 
element of the free group on the set {ag}. If the family {đa} is finite, then G is finitely 
generated, of course. If both the families {aq} and {rg} are finite, then G is said to be 
finitely presented, and these families form what is called a finite presentation for G. 


This procedure for specifying G is far from satisfactory. A presentation for G does 
determine G uniquely, up to isomorphism; but two completely different presentations 
can lead to groups that are isomorphic. Furthermore, even in the finite case there is no 
effective procedure for determining, from two different presentations, whether or not 
the groups they determine are isomorphic. This result is known as the “unsolvability 
of the isomorphism problem” for groups. 

Unsatisfactory as it is, this is the best we can do! 


Exercises 


1. If G = G, * G2, show that 
G/[G, G] = (Gi/{G1, Gi) ® (G2/[G2, G2). 


[Hint: Use the extension condition for direct sums and free products to define 
homomorphisms 


G/(G, G} = (G1/[G1, G1) © (G2/[G2, G2)) 


that are inverse to each other.) 

2. Generalize the result of Exercise | to arbitrary free products. 

3. Prove the following: 
Theorem. Let G = Gı * Gi, where G; and G2 are cyclic of orders m and n, 
respectively. Then m andn ere uniquely determined by G. 
Proof. 
(a) Show G/[G, G] has order mn. 
(b) Determine the largest integer k such that G has an element of order k. (See 

Exercise 2 of §68.) 

(c) Prove the theorem. 

4. Show that it G = G1 ® G2, where G; and G2 are cyclic of orders m and n, 
respectively, then m and n are not uniquely determined by G in general. (Hint: 
If m and n are relatively prime, show that G is cyclic of order mn.] 
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§70 The Seifert-van Kampen Theorem 


We now return to the problem of determining the fundamental group of a space X that 
is written as the union of two open subsets U and V having path-connected intersec- 
tion. We showed in §59 that, if x9 € U N V, the images of the two groups 771 (U, xo) 
and z; (V, xo) in 7(X, xp), under the homomorphisms induced by inclusion, generate 
the latter group. In this section, we show that 2, (X, xo) is, in fact, completely deter- 
mined by these two groups, the group xı (U NV, xo), and the various homomorphisms 
of these groups induced by inclusion. This is a basic result about fundamental groups. 
It will enable us to compute the fundamental groups of a number of spaces, including 
the compact 2-manifolds. 


Theorem 70.1 (Seifert-van Kampen theorem). Let X = U UV, where U and V 
are open in X; assume U, V, and U N V are path connected; letxo € U N V. Let H 
be a group, and let 


„$1: 7ı(U, xo) —> H and $2: 2\(V,x9) — H 


be homomorphisms. Let ij, i2, j1, j2 be the homomorphisms indicated in the following 
diagram, each induced by inclusion. 


71 (U, xo) 


ea 


m(U AV, xo) —> m1(X,x0) °>H 


le 


m™(V, xo) 


If ġı o i1 = 2 0 i2, then there is a unique homomorphism ® : 1\(X, x9) > H such 
that Do ji = ġı and Do jo = do. 

This theorem says that if @; and ¢2 are arbitrary homomorphisms that are “com- 

patible on U N V,” then they induce a homomorphism of 7; (X, xo) into H. 
Proof. Uniqueness is easy. Theorem 59.1 tells us that 21(X, xo) is generated by the 
images of jı and j2. The value of on the generator j1(g1) must equal ¢ (g1), and its 
value on j2(g2) must equal ¢2(g2). Hence ® is completely determined by ġı and ¢p. 
To show ® exists is another matter! 

For convenience, we introduce the following notation: Given a path f in X, we 
shall use {f} to denote its path-homotopy class in X. If f happens to lie in U, then 
[f ]u is used to denote its path-homotopy class in U. The notations [ f]y and [fJunyv 
are defined similarly. 

Step 1. We begin by defining a set map p that assigns, to each loop f based at xo 
that lies in U or in V, an element of the group H. We define 


e(fy=olflu) iff liesin U, 
pf) =d2(fflv) if f lies in V. 
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Then p is well-defined, for if f lies in both U and V, 


ai(flu) =oiii{flunv) and gf flv) = zizli flunv). 


and these two elements of H are equal by hypothesis. The set map p satisfies the 
following conditions: 


(1) If {flu = [glv or if {flv = gly. then p( f) = p(g). 
(2) If both f and g lie in U, or if both lie in V, then p( f * g) = p(f)- p(g). 


The first holds by definition, and the second holds because ø; and ¢2 are homomor- 
phisms. 


Step 2. We now extend p to a set map o that assigns, to each path f lying in 
U or V, an element of H, such that the map ø satisfies condition (1) of Step 1, and 
satisfies (2) when f x g is defined. 

To begin, we choose, for each x in X, a path a, from xo to x, as follows: If x = x0, 
let a; be the constant path at xp. If x € U N V, let œ, be a path in U N V. And if x is 
in U or V but not in U N V, let œ, be a path in U or V, respectively. 

Then, for any path f in U or in V, we define a loop L( f) in U or V, respectively, 
based at xo, by the equation 


L(f) =a, * (f * ay), 


where x is the initial point of f and y is the final point of f. See Figure 70.1. Finally, 
we define 


a(f) = p(L(f)). 


Figure 70.1 
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First, we show that ø is an extension of p. If f is a loop based at xo lying in either 
U or V, then 


Lf) = xy * (f * xy) 


because a@,, is the constant path at x9. Then L(f) is path homotopic to f in either U 
or V, so that p(L(f)) = p(f) by condition (1) for p. Hence o( f) = p(f). 

To check condition (1), let f and g be paths that are path homotopic in U or 
in V. Then the loops L( f) and L(g) are also path homotopic either in U or in V, so 
condition (1) for p applies. To check (2), let f and g be arbitrary paths in U or in V 
such that f(1) = (0). We have 


L(f) * L(g) = (atx * (f * G@y)) * (ay * (8 * &z)) 


for appropriate points x, y, and z; this loop is path homotopic in U or V to L(f * g). 
Then 


P(L(F * 8)) = o(L(f) * L(g)) = PLF) - a(L(8)) 


by conditions (1) and (2) for p. Hence o( f * g) = a(f)-o(g). 


Step 3. Finally, we extend o to a set map t that assigns, to an arbitrary path f 
of X, an element of H. It will satisfy the following conditions: 


(1) If{[f] = [g], then t( f) = t(g). 
(2) t(f *g) =t(f)- t(g) if f * g is defined. 
Given f, choose a subdivision sọ < --- < s, of [0, 1] such that f maps each of 
the subintervals [{s;~;, s;] into U or V. Let f; denote the positive linear map of [0, 1] 
onto [s;-, $i], followed by f. Then f; is a path in U or in V, and 


(f] = (fil «---* [fa]. 


If t is to be an extension of ø and satisfy (1) and (2), we must have 


(*) t(f) =a(fi) olfa) alfa). 


So we shall use this equation as our definition of t. 

We show that this definition is independent of the choice of subdivision. It suffices 
to show that the value of t ( f ) remains unchanged if we adjoin a single additional point 
p to the subdivision. Let i be the index such that s;_; < p < s;. If we compute t(/) 
using this new subdivision, the only change in formula (x) is that the factor ø ( f;) 
disappears and is replaced by the product o(f/) - a ( f”), where f/ and f;" equal the 
positive linear maps of [0, 1] to [sj-1, p} and to [p, si], respectively, followed by f. 
But f; is path homotopic to f; * f/’ in U or V, so that o( fi) = o(f!) -o( ff’), by 
conditions (1) and (2) for ø. Thus t is well-defined. 

It follows that t is an extension of ø . For if f already lies in U or V, we can use 
the trivial partition of (0, 1] to define r( f); then t( f) = o(f) by definition. 
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Step 4. We prove condition (1) for the set map r. This part of the proof requires 
some care. 

We first verify this condition in a special case. Let f and g be paths in X from x 
to y, say, and let F be a path homotopy between them. Let us assume the additional 


hypothesis that there exists a subdivision sọ, ..., Sn of (0, 1] such that F casties each 
rectangle R; = [sj-1,5;] x Z into either U or V. We show in this case that r(f) = 
(g). 


Given i, consider the positive linear map of [0, 1] onto {s;_1, s;] followed by f 
or by g; and call these two paths f, and g;, respectively. The restriction of F to 
the rectangle R; gives us a homotopy between f; and g; that takes place in either U 
or V, but it is not a path homotopy because the end points of the paths may move 
during the homotopy. Let us consider the paths traced out by these end points during 
the homotopy. We define £; to be the path £;(t) = F(s;,¢). Then £; is a path in X 
from f (s;) to g(s;). The paths Bp and 8, are the constant paths at x and y, respectively. 
See Figure 70.2. We show that for each i, 


fi * Bi ~p Bi-1 * 8i, 
with the path homotopy taking place in U or in V. 


Figure 70.2 


In the rectangle R;, take the broken-line path that runs along the bottom and nght 
edges of Rj, from s;_, x Oto s; x O to s; x 1; if we follow this path by the map F, we 
obtain the path f; x 8i. Similarly, if we take the broken-line path along the left and top 
edges of R; and follow it by F, we obtain the path £;—ı » gi. Because R; is convex, 
there is a path homotopy in R; between these two broken-line paths; if we follow by F, 
we obtain a path homotopy between f; * 8; and 8; * gi that takes place in either U 
or V, as desired. 

It follows from conditions (1) and (2) for o that 


o(fi) -a (Bi) = a (Bi-1) - (gi), 
so that 
(4+) (fi) = o(Bi-1) - 0(gi) -o (Bi). 
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It follows similarly that since £o and £, are constant paths, o (8o) = a (Bn) = 1. (For 
the fact that By x Bo = Bo implies that ø (Bo) - a (80) = o(Bo).) 

We now compute as follows: 

t(f) =o(ft)-o(f2)---o(fn). 
Substituting (**) in this equation and simplifying, we have the equation 
t(f) =o(g1) -0 (82): -+0 (8n) 
= t(8). 
Thus, we have proved condition (1) in our special case. 

Now we prove condition (1) in the general case. Given f and g and a path homo- 
topy F between them, let us choose subdivisions so, ..., Sn and fo, ..., tm of [0, 1] 
such that F maps each subrectangle {s;_), 5;] x {t;-1,¢;] into either U or V. Let fj 
be the path fj(s) = F(s, tj), then fo = f and fm = g. The pair of paths f;_, and fj 
satisfy the requirements of our special case, so that t(fj;-1) = t(f;) for each j. It 
follows that t(f) = t(g), as desired. 

Step 5. Now we prove condition (2) for the set map r. Given a path f * g in X, 
let us choose a subdivision s9 < --- < Sn of {0, 1] containing the point 1/2 as a 
subdivision point, such that f * g carries each subinterval into either U or V. Let k be 
the index such that są = 1/2. 

For i = 1,...,k, the positive linear map of [0, 1] to [s;-1, si}, followed by f * g, 
is the same as the positive linear map of [0, 1] to [2s;_1, 2s;] followed by f; call this 
map fi. Similarly, fori = k + 1,...,n, the positive linear map of {0, L] to [s;-1, s;}, 
followed by f «g, is the same as the positive linear map of [0, 1] to [2s;_, — 1, 2s; — 1] 
followed by g; call this map g;_,. Using the subdivision so, ..., Sn for the domain of 
the path f * g, we have 


t(f *g)=a(fi)---o( fk) -9(81)---O(8n—x)- 
Using the subdivision 250, .... 2s, for the path f, we have 
t(f) =o(fi)---o( fk). 
And using the subdivision 2s, — 1,..., 25, — 1 for the path g, we have 
™(g) =0 (81) ---o(8n-x)- 
Thus (2) holds trivially. 
Step 6. The theorem follows. For each loop f in X based at xo, we define 
eFI = (f). 


Conditions (1) and (2) show that ® is a well-defined homomorphism. 
Let us show that © o jı = ġı. If f is a loop in U, then 


Du = OCs) 
=t(f) 
= p(f) =e flu), 
as desired. The proof that © o j2 = ġ is similar. B 
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The preceding theorem is the modern formulation of the Seifert-van Kampen the- 
orem. We now tum to the classical version, which involves the free product of two 
groups. Recall that if G is the external free product G = G, * G2, we often treat G1 
and G72 as if they were subgroups of G, for simplicity of notation. 


Theorem 70.2 (Seifert-van Kampen theorem, classical version). Assurne the hy- 
potheses of the preceding theorem. Let 


jm (U, xo) * 41 (V, xo) — 7 (X, xo) 


be the homomorphism of the free product that extends the homomorphisms jı and jz 
induced by inclusion. Then j is surjective, and its kernel is the least normal subgroup 
N of the free product that contains all elements represented by words of the form 


(e) |, i2(8)), 
for g € mı(U N V, xo). 


Said differently, the kernel of j is generated by all elements of the free product of 
the form i;(g)~'i2(g), and their conjugates. 
Proof. The fact that xz; (X, xo) is generated by the images of jı and jz implies that j 
is surjective. 

We show that M C kerj. Since ker j is normal, it is enough to show that 
i1(g)—!i2(g) belongs to ker j for each g € m(UMV,x0). fi: UNV > Xis 
the inclusion mapping, then 


jile) = jilg) = f(g) = joi2(g) = ji2(8). 
Then i; (g)~!i2(g) belongs to the kernel of j. 
It follows that j induces an epimorphism 

k : nı (U, xo) * 11 (V,x0)/N — m1(X, xo). 


We show that N equals ker j by showing that k is injective. It suffices to show that k 
has a left inverse. 

Let H denote the group 7)(U, xo) * m1(V,x9)/N. Let ġı : zı(U, xo) > H 
equal the inclusion of 7,(U, xo) into the free product followed by projection of the 
free product onto its quotient by N. Let @2 : 2,(V, x9) > H be defined similarly. 
Consider the diagram 


7 (U, xo) 


se, 


fy Pars 
m(U NV, xo) ——~ n (X, xo) =H 


ee ee 


71(V, xo) 
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It is easy to see that ġı o i} = G2 o i2. For if g € nı (U N V, xo), then ġı (ii (g)) is 
the coset i}(g)N in H, and ġ2(i2(g)) is the coset i2(g)N. Because i)(g)~'i2(g) € N, 
these cosets are equal. 

It follows from Theorem 70.1 that there is a homomorphism © : 27,(X, xo) > H 
such that ® o ji = ġı and Do jy = go. We show that ® is a left inverse for k. It 
suffices to show that  o k acts as the identity on any generator of H, that is, on any 
coset of the form gN, where g is in x (U, xo) or zı (V, xp). But if g € xı (U, xo), we 
have 


k(gN) = j(g) = ji(8), 
so that 
@(k(gN)) = OCii(g)) = b1(2) = aN, 


as desired. A similar remark applies if g € 71(V, xo). a 


Corollary 70.3. Assume the hypotheses of the Seifert-van Kampen theorem. If UNV 
is simply connected, then there is an isomorphism 


k: nı(U, xo) *1,(V, xo) —> 7 (X, x9). 


Corollary 70.4. Assume the hypotheses of the Seifert-van Kampen theorem. If V is. 
simply connected, there is an isomorphism 


k : nı(U, x0)/N — n (X, x0), 


where N is the least normal subgroup of z, (U, xo) containing the image of the homo- 
morphism 


ii : (UNV, xo) > z (U, xo). 


EXAMPLE 1 Let X be a theta-space. Then X is a Hausdorff space that is the union of 
three arcs A, B, and C, each parr of which intersect precisely in their end points p and q. 
We showed earlier that the fundamental group of X is not abelian. We show here that this 
group is in fact a free group on two generators. 

Let a be an intenor point of A and let b be an interior point of B. Wnte X as the union 
of the open sets U = X —a and V = X —b. See Figure 70.3. The space UNV = X -a -b 
is simply connected because it is contractible. Furthermore, U and V have infinite cyclic 
fundamenta! groups, because U has the homotopy type of B U C and V has the homotopy 
type of AUC. Therefore, the fundamental group of X is the free product of two infinite 
cyclic groups, that is, it is a free group on two generators. 
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b 


Figure 70.3 


Exercises 


In the following exercises, assume the hypotheses of the Seifert-van Kampen theo- 
rem. 
1. Suppose that the homomorphism i, induced by inclusion į : U A V — X is 
trivial. 
(a) Show that jı and jz induce an epimorphism 
h : (x (U, xo)/ Ni) * (1 (V, x0)/N2) — 11 (X, xo), 


where N; is the least normal subgroup of x; (U, xo) containing image i), and 
N2 is the least normal subgroup of 7z; (V, xo) containing image i2. 

(b) Show that h is an isomorphism. [Hint: Use Theorem 70.1 to define a left 
inverse for h.] 


2. Suppose that iz is surjective. 
(a) Show that jı induces an epimorphism 


h : y(U, x0)/M —> m(X, xo), 


where M is the least normal subgroup of 72,(U, xo) containing i, (ker i2). 

{Hint: Show jı is surjective.} 

Show that h is an isomorphism. (Hint: Let H = m,(U,x0)/M. Let ġi : 

zı(U, xo) — H be the projection. Use the fact that zı (U N V, xo) / keriz is 

isomorphic to x(V, xo) to define a homomorphism ¢2 : 7z; (V, xo) > H. 

Use Theorem 70.1 to define a left inverse for h.] 

3. (a) Show that if G, and G2 have finite presentations, so does G] * G2- 

(b) Show that if zı (UN V, xo) is finitely generated and 2; (U, xo) and 77; (V, xo) 

have finite presentations, then 2)(X, xq) has a finite presentation. {Hint: If 
N’ is a normal subgroup of z; (U, xo) * 7; (V, xo) that contains the elements 
i1(gi)~ 'i2(g;) where g; runs over a set of generators for x; (U N V, xo), then 
N’ contains i,(g)~'i2(g) for arbitrary g.] 


(b 


= 
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§71 The Fundamental Group of a Wedge of Circles 


In this section, we define what we mean by a wedge of circles, and we compute its 
fundamental group. 


Definition. Let X be a Hausdorff space that is the union of the subspaces S;,..., Sn, 
each of which is homeomorphic to the unit circle St. Assume that there is a point p 
of X such that S; N S; = {p} whenever i # j. Then X is called the wedge of the 
circles S|,..., Sn. 


Note that each space 5;, being compact, is closed in X. Note also that X can be 
imbedded in the plane; if C; denotes the circle of radius i in R? with center at (i, 0), 
then X is homeomorphic to C1 U---U Cp. 


Theorem 71.1. Let X be the wedge of the circles S4, ..., Sn; let p be the common 
point of these circles. Then 7\(X, p) is a free group. If f; is a loop in S; that rep- 
resents a generator of n; (S;, p), then the loops fi, ..., fn represent a system of free 
generators for m,(X, p). 


Proof. The result is immediate if n = 1. We proceed by induction on n. The proof is 
similar to the one given in Example | of the preceding section. 

Let X be the wedge of the circles $4, ..., Sn, with p the common point of these 
circles. Choose a point q; of S; different from p, for each i. Set W; = S; — qi, and let 


U=S,UW2U---UW, and V=W,USU---US,. 


Then UNV = W,U---UW,. See Figure 71.1. Each of the spaces U, V, and UNV is 
path connected, being the union of path-connected spaces having a point in common. 


S, w, 4; w, 
w: W, S: 
% ms 
w, W, W, a Ws S, S, 
u Uny v 
Figure 71.1 


The space W; is homeomorphic to an open interval, so it has the point p as a 
deformation retract; let F; : W; x / —> W; be the deformation retraction. The maps F; 
fit together to define a map F : (UNV) x I —> U N V that is a deformation retraction 
of U N V onto p. (To show that F is continuous, we note that because S; is a closed 
subspace of X, the space W; = S; — q; is a closed subspace of U N V, so that W; x I 
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is a closed subspace of (U N V) x /. Then the pasting lemma applies.) It follows that 
UNV is simply connected, so that x; (X, p) is the free product of the groups xı (U, p) 
and x(V, p), relative to the monomorphisms induced by inclusion. 

A similar argument shows that Sı is a deformation retract of U and S2U ---US, is 
a deformation retract of V. It follows that xz; (U, p) is infinite cyclic, and the loop fi 
represents a generator. It also follows, using the induction hypothesis, that x ,(V, p) is 
a free group, with the loops f2, ..., fa representing a system of free generators. Our 
theorem now follows from Theorem 69.2. a 


We generalize this result to a space X that is the union of infinitely many circles 
having a point in common. Here we must be careful about the topology of X. 


Definition. Let X be a space that is the union of the subspaces Xg, fora e J. The 
topology of X is said to be coherent with the subspaces Xa provided a subset C of X 
is closed in X if CN Xq is closed in Xa for each æ. An equivalent condition is that a 
set be open in X if its intersection with each Xq is open in Xe. 


If X is the union of finitely many closed subspaces X,,..., Xn, then the topology 
of X is automatically coherent with these subspaces, since if C N X; is closed in X;, it 
is closed in X, and C is the finite union of the sets C N X;. 


Definition. Let X be a space that is the union of the subspaces Sa, fora € J, each 
of which is homeomorphic to the unit circle. Assume there is a point p of X such that 
Sa N Sg = {p} whenever a # 8. If the topology of X is coherent with the subspaces 
Sa, then X is called the wedge of the circles Sy. 


In the finite case, the definition involved the Hausdorff condition instead of the 
coherence condition; in that case the coherence condition followed. In the infinite 
case, this would no longer be true, so we included the coherence condition as part of 
the definition. We would include the Hausdorff condition as well, but that is no longer 
necessary, for it follows from the coherence condition: 


Lemma 71.2. Let X be the wedge of the circles Sa, fora € J. Then X is normal. 
Furthermore, any compact subspace of X is contained in the union of finitely many 
circles Sq. 


Proof. It is clear that one-point sets are closed in X. Let A and B be disjoint closed 
subsets of X; assume that B does not contain p. Choose disjoint subsets Ua and Vy 
of Sa that are open in Sy and contain {p} U (A N Sa) and B N Sa, respectively. Let 
U = |] Ua and V = ( Va; then U and V are disjoint. Now U N Sy = Ug because 
all the sets Uy contain p, and V N Sa = Va because no set V, contains p. Hence U 
and V are open in X, as desired. Thus X is normal. 

Now let C be a compact subspace of X. For each æ for which it is possible, choose 
a point xa of C N (Sa — p). The set D = {xq} is closed in X, because its intersection 
with each space Sq is a one-point set or is empty. For the same reason, each subset 
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of D is closed in X. Thus D is a closed discrete subspace of X contained in C; since 
C is limit point compact, D must be finite. a 


Theorem 71.3. Let X be the wedge of the circles Sa, fora € J; let p be the common 
point of these circles. Then m,(X, p) is a free group. If fa is a loop in Sa representing 
a generator of (Sq, p), then the loops { fa} represent a system of free generators for 
m(X, p). 


Proof. Letia : 11(Sq, p) — 71(X, p) be the homomorphism induced by inclusion; 
let Ga be the image of iy. 

Note that if f is any loop in X based at p, then the image set of f is compact, 
so that f lies in some finite union of subspaces S,. Furthermore, if f and g are two 
loops that are path homotopic in X, then they are actually path homotopic in some 
finite union of the subspaces Se. 

It follows that the groups {Ga} generate 7;(X, p). For if f is a loop in X, then 
f lies in Se, U---U Sa, for some finite set of indices; then Theorem 71.1 implies 
that [f] is a product of elements of the groups Ga, ..., Gap- Similarly, it follows 
that ig is a monomorphism. For if f is a loop in Sg that is path homotopic in X toa 
constant, then f is path homotopic to a constant in some finite union of spaces Sw, so 
that Theorem 71.1 implies that f is path homotopic to a constant in Sg. 

Finally, suppose there is a reduced nonempty word 


w = (8a;----» Ban) 


in the elements of the groups Ge that represents the identity element of x; (X, p). Let 
f be a loop in X whose path-homotopy class is represented by w. Then f is path 
homotopic to a constant in X, so it is path homotopic to a constant in some finite union 
of subspaces Sy. This contradicts Theorem 71.1. B 


The preceding theorem depended on the fact that the topology of X was coherent 
with the subspaces S,. Consider the following example: 


EXAMPLE |. Let Cn be the circle of radius 1/7 in R? with center at the point (1/n, 0). 
Let X be the subspace of R? that is the union of these circles: then X is the union of a count- 
ably infinite collection of circles, each pair of which intersect in the origin p. However, X 
is not the wedge of the circles C,, we call X (for convenience) the infinite earring. 

One can verify directly that X does not have the topology coherent with the sub- 
spaces C,,; the intersection of the positive x-axis with X contains exactly one point from 
each circle C,,, but it is not closed in X. Alternatively, for each n, let f, be a loop in C, that 
represents a generator of x (Cr, p), we Show that x; (X, p) is not a free group with {[ f,}} 
as a system of free generators. Indeed, we show the elements { f; } do not even generate the 
group 7 (X, p). 

Consider the loop g in X defined as follows: For each n, define g on the interval 
[1/(n + 1), 1/n] to be the positive linear map of this interval onto (0, 1] followed by fn. 
This specifies g on (0, 1]; define g(0) = p. Because X has the subspace topology derived 
from R?, it is easy to see that g is continuous. See Figure 71.2. We show that given n, the 
element [g] does not belong to the subgroup G, of 71(X, p) generated by [ f1}, .... [Ja]. 
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Choose N > n, and consider the map h : X — Cy defined by setting A(x) = x for 
x € Cy and A(x) = p otherwise. Then A is continuous, and the induced homomorphism 
hy : \(X, p) > nı(Cy, p) carries each element of G, to the identity element. On 
the other hand, A o g is the loop in Cy that is constant outside (1/(N + 1), 1/NJ and 
on this interval equals the positive linear map of this interval onto (0, 1} followed by fy. 
Therefore, h, ((g]) = { fw]. which generates m1(Cy, p)! Thus [8] ¢ Ga- 


g 
A ~ 
a f- ) 
Rr Pe f 


Figure 71.2 


In the preceding theorem, we calculated the fundamental group of a space that is 
an infinite wedge of circles. For later use, we now show that such spaces do exist! (We 
shall use this result in Chapter 14.) 


*Lemma 71.4. Given an index set J, there exists a space X that is a wedge of 
circles Sa fora € J. 


Proof. Give the set J the discrete topology, and let E be the product space S! x J. 
Choose a point bọ € S}, and let X be the quotient space obtained from E by collapsing 
the closed set P = bọ x J to a point p. Letz : E — X be the quotient map; let 
Sa = 7 (S! x a). We show that each S, is homeomorphic to S? and X is the wedge of 
the circles Sy. 

Note that if C is closed in S! x a, then z (C) is closed in X. For r~!2(C) = C 
if the point bọ x «æ is not in C, and n~'n(C) = CU P otherwise. In either case, 
n~'2(C) is closed in S! x J, so that z (C) is closed in X. 

It follows that Sq is itself closed in X, since S! x æ is closed in S$! x J, and that 
z maps S! x æ homeomorphically onto Sa. Let Za be this homeomorphism. 

To show that X has the topology coherent with the subspaces Se, let D C X and 
suppose that DN Sa is closed in S, for each a. Now 


a (Dy N(S! x æ) = nz (DN Sa); 


the latter set is closed in S! x œ because Ta is continuous. Then 2~!(D) is closed in 
S! x J, so that D is closed in X by definition of the quotient topology. a 
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Exercises 


1. 


Let X be a space that is the union of subspaces S1, ..., Sn, each of which is 

homeomorphic to the unit circle. Assume there is a point p of X such that 

S&N S; = {p} fori Æ j. 

(a) Show that X is Hausdorff if and only if each space 5; is closed in X. 

(b) Show that X is Hausdorff if and only if the topology of X is coherent with 
the subspaces S;. 

(c) Give an example to show that X need not be Hausdorff. [Hint: See Exer- 
cises 5 of §36.] 


. Suppose X is a space that is the union of the closed subspaces X),..., Xn; 


assume there is a point p of X such that X; N X; = {p} fori # j. Then we call 
X the wedge of the spaces X1, ..., Xn, and write X = X; v---v X,. Show 
that if for each i, the point p is a deformation retract of an open set W; of X;, 
then 71 (X, p) is the external free product of the groups m,(X;, p) relative to the 
monomorphisms induced by inclusion. 


. What can you say about the fundamental group of X v Y if X is homeomorphic 


to S! and Y is homeomorphic to $7? 


. Show that if X is an infinite wedge of circles, then X does not satisfy the first 


countability axiom. 


. Let S, be the circle of radius n in R? whose center is at the point (n, 0). Let Y 


be the subspace of R? that is the union of these circles; let p be their common 

point. 

(a) Show that Y is not homeomorphic to a countably infinite wedge X of circles, 
nor to the space of Example 1. 

(b) Show, however, that 71(Y, p) is a free group with {{ f,]} as a system of free 
generators, where fẹ is a loop representing a generator of 71 (Sn, p). 


§72 Adjoining a Two-cell 


We have computed the fundamental group of the torus T = S! x S! in two ways. One 
involved considering the standard covering map p x p: R x R > S! x S! and using 
the lifting correspondence. Another involved a basic theorem about the fundamental 
group of a product space. Now we compute the fundamental group of the torus in yet 
another way. 


If one restricts the covering map p x p to the unit square, one obtains a quotient 


map x : I? > T. It maps Bd 1? onto the subspace A = (S! x bo) U (bo x S!), which 
is the wedge of two circles, and it maps the rest of /? bijectively onto T — A. Thus, T 


can 


be thought of as the space obtained by pasting the edges of the square /* onto the 


space A. 
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The process of constructing a space by pasting the edges of a polygonal region 
in the plane onto another space is quite useful. We show here how to compute the 
fundamental group of such a space. The applications will be many and fruitful. 


Theorem 72.1. Let X be a Hausdorff space; let A be a closed path-connected sub- 
space of X. Suppose that there is a continuous map h : B? -» X that maps Int B? 
bijectively onto X — A and maps S' = Bd B? into A. Let p e S! and leta = h(p); let 
k: (S!, p) > (A, a) be the map obtained by restricting h. Then the homomorphism 


i, : (A, a) —> 1 (X,a) 
induced by inclusion is surjective, and its kernel is the least normal subgroup of 
m\(A, a) containing the image of k, : m(S!, p) > ™(A, a). 


We sometimes say that the fundamental group of X is obtained from the funda- 
mental group of A by “killing off” the class k.[f], where [f} generates m(S!, p). 


Proof. Step 1. The origin 0 is the center point of B?; let xq be the point h(O) of X. If 
U is the open set U = X — xg of X, we show that A is a deformation retract of U. See 
Figure 72.1. 


Figure 72.1 


Let C = h(B?), and let x : B? — C be the map obtained by restricting the range 
of h. Consider the map 


z xid: B? xI — Cxi; 


it is a closed map because B? x I is compact and C x / is Hausdorff; therefore, it is a 
quotient map. Its restriction 


n’ : (B? — 0) x I — (C — xo) x I 


is also a quotient map, since its domain is open in B? x I and is saturated with respect 
to x x id. There is a deformation retraction of B? — 0 onto S!; it induces, via the 
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quotient map 2’, a deformation retraction of C — xy onto 7(S'). We extend this 
deformation retraction to all of U x / by letting it keep each point of A fixed during 
the deformation. Thus A is a deformation retract of U. 

It follows that the inclusion of A into U induces an isomorphism of fundamental 
groups. Our theorem then reduces to proving the following statement: 

Let f be a loop whose class generates n\(S', p). Then the inclusion of U into X 
induces an epimorphism 


m\(U,a) —> 7(X, a) 


whose kernel is the least normal subgroup containing the class of the loop g =ho f. 

Step 2. In order to prove this result, it is convenient to consider first the homomor- 
phism 71(U, b) — 72 (X, b) induced by inclusion relative to a base point b that does 
not belong to A. 

Let b be any point of U — A. Write X as the union of the open sets U and 
V = X — A = x(int B’). Now U is path connected, since it has A as a deformation 
retract. Because 7 is a quotient map, its restriction to Int B? is also a quotient map 
and hence a homeomorphism; thus V is simply connected. The set U N V = V — xo 
is homeomorphic to Int B? — 0, so it is path connected and its fundamental group is 
infinite cyclic. Since b is a point of U N V, Corollary 70.4 implies that the homomor- 
phism 


m(U, b) —> m(X, b) 


induced by inclusion is surjective, and its kernel is the least normal subgroup contain- 
ing the image of the infinite cyclic group 7\(U N V, b). 

Step 3. Now we change the base point back to a, proving the theorem. 

Let q be the point of B? that is the midpoint of the line segment from 0 to p, and 
let b = h(q); then b is a point of U N V. Let fo be a loop in Int B? — 0 based at g 
that represents a generator of the fundamental group of this space; then go = ho fo 
is a loop in U AN V based at b that represents a generator of the fundamental group of 
U N V. See Figure 72.2. 

Step 2 tells us that the homomorphism x; (U, b) —> 71(X, b) induced by inclusion 
is surjective and its kernel is the least normal subgroup containing the class of the loop 
20 = h o fo. To obtain the analogous result with base point a we proceed as follows: 

Let y be the straight-line path in B? from q to p; let ô be the path ô = h o y in U 
from b to a. The isomorphisms induced by the path ô (both of which we denote by ô) 
commute with the homomorphisms induced by inclusion in the following diagram: 


m(U, b) ——> 1 (X, b) 
} p 
m\(U, a) ——> 7, (X, a) 


Therefore, the homomorphism of z; (U, a) into 7,(X, a) induced by inclusion is sur- 
jective, and its kernel is the least normal subgroup containing the element 5({go}). 
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Figure 72.2 


The loop fo represents a generator of the fundamental group of Int B? — 0 based 
at q. Then the loop y * (fo * y) represents a generator of the fundamental group of 
B? — 0 based at p. Therefore, it is path homotopic either to f or its reverse; suppose 
the former. Following this path homotopy by the map h, we see that 5 * (go * 6) is path 
homotopic in U to g. Then 5([go]) = [g], and the theorem follows. a 


There is nothing special in this theorem about the unit ball B?. The same result 
holds if we replace B? by any space B homeomorphic to B?, if we denote by Bd B the 
subspace corresponding to S! under the homeomorphism. Such a space B is called a 2- 
cell. The space X of this theorem is thought of as having been obtained by “adjoining 
a 2-cell” to A. We shail treat this situation more formally later. 


Exercises 


1. Let X be a Hausdorff space; let A be a closed path-connected subspace. Suppose 
that h : B” — X is a continuous map that maps $"~! into A and maps Int 8” 
bijectively onto X — A. Let a be a point of h(S"~'). If n > 2, what can you say 
about the homomorphism of 2; (A, a) into 2;(X, a) induced by inclusion? 


2. Let X be the adjunction space formed from the disjoint union of the normal, 
path-connected space A and the unit ball B? by means of a continuous map 
fF S! > A. (See Exercise 8 of §35.) Show that X satisfies the hypotheses of 
Theorem 72.1. Where do you use the fact that A is normal? 


3. Let G be a group; let x be an element of G; let N be the least normal subgroup 
of G containing x. Show that if there is a normal, path-connected space whose 
fundamental group is isomorphic to G, then there is a normal, path-connected 
space whose fundamental group is isomorphic to G/N. 
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§73 The Fundamental Groups of the Torus and the Dunce 
Cap 


We now apply the results of the preceding section to compute two fundamental groups, 
one of which we already know and the other of which we do not. The techniques 
involved will be important later. 


Theorem 73.1. The fundamental group of the torus has a presentation consisting of 
two generators a, P and a single relation aBa-! p- r 


Proof. Let X = S! x S! be the torus, and let h : 1? — X be obtained by restricting 
the standard covering map p x p : Rx R —> S! x S!. Let p be the point (0, 0) of 
Bd /?, let a = h(p), and let A = h(Bd 1°). Then the hypotheses of Theorem 72.1 are 
satisfied. 

The space A is the wedge of two circles, so the fundamental group of A is free. 
Indeed, if we let ag be the path ao(t) = (t, 0) and bo be the path bo(t) = (0, t) in 
Bd /?, then the paths a = h o ag and £ = h o bg are loops in A such that [a] and [8] 
form a system of free generators for 7,(A, a). See Figure 73.1. 


h 
-a 
ba b, 
P 
P x=5S'xs! 
Figure 73.1 


Now let a; and b be the paths a; (t) = (r, 1) and bı (t) = (1, t) in Bd J*. Consider 
the loop f in Bd 7? defined by the equation 


f = ao * (bı * (a * bo)). 


Then f represents a generator of (Bd 1°, p); and the loop g = h o f equals the 
product æ * (8 * (& * B)). Theorem 72.1 tells us that zı (X, a) is the quotient of the 
free group on the free generators [a] and [8] by the least normal subgroup containing 
the element [a](8){e]~'[g]7!. | 


Corollary 73.2. The fundamental group of the torus is a free abelian group of rank 2. 


Proof. Let G be the free group on generators a, 6; and let N be the least normal 
subgroup containing the element aBa~'g~!. Because this element is a commutator, 
N is contained in the commutator subgroup {G, G] of G. On the other hand, G/N 
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is abelian; for it is generated by the cosets aN and BN, and these elements of G/N 
commute. Therefore N contains the commutator subgroup of G. 
It follows from Theorem 69.4 that G/N is a free abelian group of rank 2. a 


Definition. Let n be a positive integer with n > 1. Letr : S! + S? be rota- 
tion through the angle 27/n, mapping the point (cos @, sin@) to the point (cos(@ + 
2n/n), sin(@ +27 /n)). Form a quotient space X from the unit ball B? by identifying 
each point x of S! with the points r(x), r7(x),...,77~!(x). We shall show that X is 
a compact Hausdorff space; we call it the n-fold dunce cap. 


Let x : B? — X be the quotient map; we show that 2 is a closed map. In order 
to do this, we must show that if C is a closed set of B?, then 2—'z(C) is also closed 
in B?; it then will follow from the definition of the quotient topology that 7 (C) is 
closed in X. Let Co = C N S}; it is closed in B?. The set n~!x(C) equals the union 
of C and the sets r(Co), r2(Co), ..., "7" (Co), all of which are closed in B? because 
r is a homeomorphism. Hence z~!2(C) is closed in B?, as desired. 

Because x is continuous, X is compact. The fact that X is Hausdorff is a conse- 
quence of the following lemma, which was given as an exercise in §31. 


Lemma 73.3. Let : E — X bea closed quotient map. If E is normal, then so 
is X. 


Proof. Assume E is normal. One-point sets are closed in X because one-point sets 
are closed in E. Now let A and B be disjoint closed sets of X. Then z7 !(A) and 
m~'(B) are disjoint closed sets of E. Choose disjoint open sets U and V of E con- 
taining x~'(A) and w—!(B), respectively. It is tempting to assume that ze (U) and 
z (V) are the open sets about A and B that we are seeking. But they are not. For they 
need not be open (x is not necessarily an open map), and they need not be disjoint! 
See Figure 73.2. 


N 
N 


Figure 73.2 


So we proceed as follows: Let C = E — U and let D = E — V. Because C and 
D are closed sets of E, the sets z (C) and z (D) are closed in X. Because C contains 
no point of #7! (A), the set x (C) is disjoint from A. Then Up = X — z (C) is an open 
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set of X containing A. Similarly, Vo = X — z (D) is an open set of X containing B. 
Furthermore, Up and Vo are disjoint. For if x € Uo, then zT! (x) is disjoint from C, so 
that it is contained in U. Similarly, if x € Vo, then x“ !(x) is contained in V. Since U 
and V are disjoint, so are Up and Vp. a 


Let us note that the 2-fold dunce cap is a space we have seen before; it is home- 
omorphic to the projective plane P?. To verify this fact, recall that P? was defined 
to be the quotient space obtained from S? by identifying x with —x for each x. Let 
p : S? — P? be the quotient map. Let us take the standard homeomorphism i of B? 
with the upper hemisphere of $?, given by the equation 


ix, y) = (x,y, Q -x - y), 


and follow it by the map p. We obtain a map x : B? —> P? that is continuous, closed, 
and surjective. On Int B it is injective, and for each x € S!, it maps x and —x to the 
same point. Hence it induces a homeomorphism of the 2-fold dunce cap with P?. 

The fundamental group of the n-fold dunce cap is just what you might expect from 
our computation for P?. 


Theorem 73.4. The fundamental group of the n-fold dunce cap is a cyclic group of 
order n. 

Proof. Leth : B? — X be the quotient map, where X is the n-fold dunce cap. 
Set A = h(S}). Let p = (1,0) € S! and leta = A(p). Then h maps the arc C 
of S! running from p to r(p) onto A; it identifies the end points of C but is otherwise 
injective. Therefore, A is homeomorphic to a circle, so its fundamental group is infinite 
cyclic. Indeed, if y is the path 


y(t) = (cos(27 t/n), sin(2mt/n)) 


in S! from p to r(p), then æ = ho y represents a generator of 7,(A,a). See Fig- 
ure 73.3. 
Now the class of the loop 


f=yx((roy)*((roy)*---*(r"! oy))) 


roy 


~ 
> 


rĉoY 


roy roy 


Figure 73.3 
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generates 71(S', p). Since h(r™(x)) = A(x) for all x and m, the loop ho f equals the 
n-fold product a * (œ * (--- *a)). The theorem follows. a 


Exercises 


1. Find spaces whose fundamental groups are isomorphic to the following groups. 
(Here Z/n denotes the additive group of integers modulo n.) 
(a) Z/n x Z/m. 
(b) Z/n, x Z/n2 x--- x Zing. 
(c) Z/n * Z/m. (See Exercise 2 of §71.) 
(d) Z/n, x Z/nz *---* Z/ ny. 

2. Prove the following: 
Theorem. If G is a finitely presented group, then there is a compact Hausdorff 
space X whose fundamental group is isomorphic to G. 
Proof. Suppose G has a presentation consisting of n generators and m relations. 
Let A be the wedge of n circles; form an adjunction space X from the union 
of A and m copies B4, ..., Bm of the unit ball by means of a continuous map 
f:UBdB, > A. 
(a) Show that X is Hausdorff. 
(b) Prove the theorem in the case m = 1. 
(c) Proceed by induction on m, using the algebraic result stated in the following 

exercise. 
The construction outlined in this exercise is a standard one in algebraic topol- 

ogy; the space X is called a two-dimensional CW complex. 

3. Lemma. Let f : G — H and g: H — K be homomorphisms; assume f is 
surjective. If xo € G, and if ker g is the least normal subgroup of H containing 
f (xo), then ker(g o f) is the least normal subgroup N of G containing ker f 
and xo. 
Proof. Show that f(N) is normal; conclude that ker(g o f) = f~! (ker g) C 
JIN) =N. 

4. Show that the space constructed in Exercise 2 is in fact metrizable. [Hint: The 
quotient map is a perfect map.] 


Chapter 12 


Classification of Surfaces 


One of the earliest successes of algebraic topology was its role in solving the problem 
of classifying compact surfaces up to homeomorphism. “Solving” this problem means 
giving a list of compact surfaces such that no two surfaces on the list are homeomor- 
phic, and such that every compact surface is homeomorphic to one of them. This is 
the problem we tackle in this chapter. 


§74 Fundamental Groups of Surfaces 


In this section, we show how to construct a number of compact connected surfaces, 
and we compute their fundamental groups We shall construct each of these surfaces 
as the quotient space obtained from a polygonal region in the plane by “pasting its 
edges together.” 

To treat this pasting process formally requires some care. First, let us define pre- 
cisely what we shall mean by a “polygonal region in the plane.” Given a point c of R?, 
and given a > 0, consider the circle of radius a in R? with center at c. Given a finite 
sequence 69 < 0; < -- < 6, of real numbers, where n > 3 and @, = & + 27, con- 
sider the points p; = c + a(cos 6;, sin9;), which lie on this circle. They are numbered 
in counterclockwise order around the circle, and pn = po. The line through p;-1 and 
pi Splits the plane into two closed half-planes; let H; be the one that contains all the 
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points p;. Then the space 
P=A,0---NH, 


is called the polygonal region determined by the points p;. The points p; are called 
the vertices of P; the line segment joining pj; and p; is called an edge of P; the 
union of the edges of P is denoted Bd P; and P — Bd P is denoted Int P. It is not hard 
to show that if p is any point of Int P, then P is the union of all line segments joining 
p and points of Bd P, and that two such line segments intersect only in the point p. 

Given a line segment L in R?, an orientation of L is simply an ordering of its end 
points; the first, say a, is called the initial point, and the second, say b, is called the 
final point, of the oriented line segment. We often say that L is oriented from a to b; 
and we picture the orientation by drawing an arrow on L that points from a towards b. 
If L’ is another line segment, oriented from c to d, then the positive linear map of L 
onto L’ is the homeomorphism A that carries the point x = (1 — s)a + sb of L to the 
point A(x) = (1 — s)c + sd of L’. 

If two polygonal regions P and Q have the same number of vertices, po, ..., Pn 
and qo, .... qn, respectively, with po = pn and go = qn, then there is an obvious 
homeomorphism h of Bd P with Bd Q that carries the line segment from p,_) to pi 
by a positive linear map onto the line segment from q;_, to qi. If p and q are fixed 
points of Int P and Int Q, respectively, then this homeomorphism may be extended to a 
homeomorphism of P with Q by letting it map the line segment from p to the point x 
of Bd P linearly onto the line segment from q to h(x). See Figure 74.1. 


JV 


a; 


Figure 74.1 


Definition. Let P be a polygonal region in the plane. A labelling of the edges of P is 
a map from the set of edges of P to a set S called the set of labels. Given an onentation 
of each edge of P, and given a labelling of the edges of P, we define an equivalence 
relation on the points of P as follows: Each point of Int P is equivalent only to itself. 
Given any two edges of P that have the same label, let h be the positive linear map 
of one onto the other, and define each point x of the first edge to be equivalent to 
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the point A(x) of the second edge. This relation generates an equivalence relation on 
P. The quotient space X obtained from this equivalence relation is said to have been 
obtained by pasting the edges of P together according to the given orientations and 
labelling. 


EXAMPLE | Consider the orientations and labelling of the edges of the tnangular region 
pictured in Figure 74.2. The figure indicates how one can show that the resulting quotient 
Space is homeomorphic to the unit ball. 


NAQ 


Figure 74.2 


EXAMPLE 2. The orientations and labelling of the edges of the square pictured in 
Figure 74 3 give rise to a space that is homeomorphic to the sphere Ss? 


$E 


Figure 74.3 


We now describe a convenient method for specifying orientations and labels for 
the edges of a polygonal region, a method that does not involve drawing a picture. 


Definition. Let P be a polygonal region with successive vertices po, ..., Pa, where 
Po = pn. Given orientations and a labelling of the edges of P, let a), ..., am be 
the distinct labels that are assigned to the edges of P. For each k, let a;, be the label 
assigned to the edge py_1 px, and let-e = +1 or —1 according as the orientation 
assigned to this edge goes from p- to px or the reverse. Then the number of edges 
of P, the orientations of the edges, and the labelling are completely specified by the 
symbol 


w= (ai)! (a)? eae (a;,)**. 
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We call this symbol a labelling scheme of length n for the edges of P; it is simply a 
sequence of labels with exponents +1 or —1. 


We normally omit the exponents that equal +1 when giving a labelling scheme. 
Then the orientations and labelling of Example | can be specified by the labelling 
scheme a~' ba, if we take po to be the top vertex of the triangle. If we take one of the 
other vertices to be po, then we obtain one of the labelling schemes baa~! or aa~'b. 

Simularly, the orientations and labelling indicated in Example 2 can be specified 
(if we begin at the lower left corer of the square) by the symbol aa~'bb-!. 

It is clear that a cyclic permutation of the terms in a labelling scheme will change 
the space X formed by using the scheme only up to homeomorphism. Later we will 
consider other modifications one can make to a labelling scheme that will leave the 
space X unchanged up to homeomorphism. 


EXAMPLE 3. We have already showed how the torus can be expressed as a quotient 
space of the unit square by means of the quotient map p x p : | x I + S! x S! This 
same quotient space Can be specified by the orientations and labelling of the edges of the 
square indicated in Figure 74 4 It can be specified also by the scheme aba~'b—! 


Figure 74.4 


EXAMPLE 4. The projective plane P? is homeomorphic to the quotient space of the 
umit ball B? obtained by identifying x with —x for each x € S!. Because the unit Square 
is homeomorphic to the unit ball, this space can also be specified by the orientations and 
labelling of ihe edges of the unit square indicated in Figure 74 5. It can be specified by the 
scheme abab. 


a a 
b b = b = P? 
b 
a a 


Figure 74.5 
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Now there is no reason to restrict oneself to a single polygonal region when form- 
ing a space by pasting edges together. Given a finite number P|, ..., Py of disjoint 
polygonal regions, along with orientations and a labelling of their edges, one can form 
a quotient space X in exactly the same way as for a single region, by pasting the edges 
of these regions together. Also, one specifies orientations and a labelling in a simi- 
lar way, by means of k labelling schemes. Depending on the particular schemes, the 
space X one obtains may or may not be connected. 


EXAMPLE 5. Figure 74.6 indicates a labelling Of the edges Of two squares for which the 
resulting quotient space is connected; it is the space called the Möbius band Of course, 
this space could also be obtained from a single square by using the labelling scheme abac, 


EXAMPLE 6 Figure 74 7 indicates a labelling scheme for the edges of two Squares for 
which the resulting quotient space is not connected. 


c 


PaT- 2 
t 


d 


Figure 74.7 


Theorem 74.1. Let X be the space obtained from a finite collection of polygonal 
regions by pasting edges together according to some labelling scheme. Then X is a 
compact Hausdorff space. 


Proof. For simplicity, we treat the case where X is formed from a single polygonal 
region. The general case is similar. 

It is immediate that X is compact, since the quotient map is continuous. To 
show X is Hausdorff, it suffices to show that the quotient map x is a closed map. 
(See Lemma 73.3.) For this purpose, we must show that for each closed set C of P, 
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the set 7—'2(C) is closed in P Now 2~'2x(C) consists of the Points of C and all 
points of P that are pasted to points of C by the map x. These points are easy to 
determine. For each edge e of P, let C, denote the compact subspace C Ne of P. If e; 
is an edge of P that is pasted to e, and if h; : e; — e is the pasting homeomorphism, 
then the set De = 2~'x(C) N e contains the space hi(Ca;). Indeed, D, equals the 
union of C, and the spaces h;(C.,), as e; ranges over all edges of P that are pasted 
to e. This union is compact; therefore, it is closed in e and in P. 

Since 2—!2(C) is the union of the set C and the sets D,, as e ranges Over all edges 
of P, itis closed in P, as desired. l 


Now we note that if X is obtained by pasting the edges of a polygonal region 
together, the quotient map 7 may map all the vertices of the polygonal region to a 
single point of X, or it may not. In the case of the torus of Example 3, the quotient 
map does satisfy this condition, while in the case of the ball and sphere of Examples 1 
and 2, it does not. We are especially happy when 7 satisfies this condition, for in this 
case one can readily compute the fundamental group of X: 


Theorem 74.2. Let P be a polygonal region; let 
w = (ai)! (aip)? 


be a labelling scheme for the edges of P. Let X be the resulting quotient space; let 
x : P — X be the quotient map. If x maps all the vertices of P to a single point xo 
of X, and ifa,,..., ax are the distinct labels that appear in the labelling scheme, then 
1\(X, xo) is isomorphic to the quotient of the free group on k generators a\,. ., a 
by the least normal subgroup containing the element 


(ai)! --- (a;,)*- 


Proof. The proof is similar to the proof we gave for the torus in §73. Because x 
maps all vertices of P to a single point of X, the space A = a(Bd P) is a wedge 
of k circles. For each i, choose an edge of P that is labelled a;; let f; be the positive 
linear map of / onto this edge oriented counterclockwise; and let g; = 7 o f;. Then the 
loops g1, ..., 8% represent a set of free generators for 2,(A, xo). The loop f running 
around Bd P once in the counterclockwise direction generates the fundamental group 
of Bd P, and the loop x o f equals the loop 


(8i) # ++ (g) 
The theorem now follows from Theorem 72.1. a 
Definition. Consider the space obtained from a 4n-sided polygonal region P by 
means of the labelling scheme 
(aba, 'b7')(a2b2a3 'b3 ') - - - Gnbaa, 1b; '). 


This space is called the n-fold connected sum of tori, or simply the n-fold torus, and 
denoted T#.---#T. 
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The 2-fold torus is pictured in Figure 74.8. If we split the polygonal region P 
along the indicated line c, each of the resulting pieces represents a torus with an open 
disc removed If we paste these pieces together along the curve c, we obtain the space 
we introduced in §60 and called there the double torus. A similar argument shows that 
the 3-fold torus T#T#T can be pictured as the surface in Figure 74.9. 


a, b, 
b. a, P JLLS 
Qh © 
b, a, 
Figure 74.8 
Figure 74.9 


Theorem 74.3. Let X denote the n-fold torus. Then 2,(X, xo) is isomorphic to the 
quotient of the free group on the 2n generators a, 1, .. , @n, Bn by the least normal 
subgroup containing the element 


(a1. Biller, 82] --[an, Bn], 
where [a, 8] = aBa-'B-', as usual. 


Proof. In order to apply Theorem 74.2, one must show that under the labelling 
scheme for X, all the vertices of the polygonal region belong to the same equivalence 
class. We leave this to you to check. a 


Definition. Let m > 1 Consider the space obtained from a 2m-sided polygonal 
region P in the plane by means of the labellang scheme 
(a141)(4242) --- (amam) 


This space is called the m-fold connected sum of projective planes, or simply the 
m-fold projective plane, and denoted P7#--.#P?. 


The 2-fold projective plane P?#P2 is pictured in Figure 74.10. The figure in- 
dicates how this space can be obtained from two copies of the projective plane by 
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deleting an open disc from each and pasting the resulting spaces together along the 
boundaries of the deleted discs. As with P? itself, we have no convenient way for pic- 
turing the m-fold projective plane as a surface in R?, for in fact it cannot be imbedded 
in R?. Sometimes, however, we can picture it in R? as a surface that intersects itself. 
(We then speak of an immersed surface rather than an imbedded one.) We explore this 
topic in the exercises. 


R 


Figure 74.10 


Theorem 74.4. Let X denote the m-fold projective plane. Then zı (X, xo) is isomor- 
phic to the quotient of the free group on m generators a, ..., @m by the least normal 
subgroup containing the element 


(021)? (a2)? --- (am). 


Proof. One needs only to check that under the labelling scheme for X, all the vertices 
of the polygonal region belong to the same equivalence class. This we leave to you. W 


There exist many other ways to form compact surfaces. One can for instance delete 
an open disc from each of the spaces P? and T, and paste the resulting spaces together 
along the boundaries of the deleted discs. You can check that this space can be obtained 
from a 6-sided polygonal region by means of the labelling scheme aabcb™ '!c-'. But 
we shall stop at this point. For it turns out that we have already obtained a complete 
list of the compact connected surfaces. This is the basic classification theorem for 
surfaces, which we shall consider shortly. 


Exercises 


1. Find a presentation for the fundamental group of P?#T. 


2. Consider the space X obtained from a seven-sided polygonal region by means of 
the labelling scheme abaaab~'a~!. Show that the fundamental group of X is 
the free product of two cyclic groups. [Hint: See Theorem 68.7.) 
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. The Klein bottle K is the space obtained from a square by means of the labelling 


scheme aba~'b, Figure 74.11 indicates how K can be pictured as an immersed 

surface in R?. 

(a) Find a presentation for the fundamental group of K. 

(b) Find a double covering map p : T — K, where T is the torus. Describe the 
induced homomorphism of fundamental groups. 


Figure 74.11 


. (a) Show that the Klein bottle is homeomorphic to P?#P? . (Hint: Split the 


square in Figure 74.11 along a diagonal, flip one of the resulting triangular 
pieces over, and paste the two pieces together along the edge labelled 5.) 

(b) Show how to picture the 4-fold projective plane as an immersed surface 
in RÌ. 


. The Mobius band M is not a surface, but what is called a “surface with bound- 


ary”. Show that M is homeomorphic to the space obtained by deleting an open 
disc from P?. 


. If n > 1, show that the fundamental group of the n-fold torus is not abelian. 


(Hint: Let G be the free group on the set {a), 81, ... , €n, Bn}; let F be the free 
group on the set {y, 6}. Consider the homomorphism of G onto F that sends a, 
and | to y and all other a; and £; to 5.] 


. If m > 1, show the fundamental group of the m-fold projective plane is not 


abelian. [Hint: There is a homomorphism mapping this group onto the group 
Z/2*Z/2.) 


§75 Homology of Surfaces 


Although we have succeeded in obtaining presentations for the fundamental groups of 
a number of surfaces, we now pause to ask ourselves what we have actually accom- 
plished. Can we conclude from our computations, for instance, that the double torus 
and the triple torus are topologically distinct? Not immediately. For, as we know, 
we lack an effective procedure for determining from the presentations for two groups 
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whether or not these groups are isomorphic. Matters are much more satisfac tory if we 
pass to the abelian group 7; /[7,, 71], where 7; = 71 (X, xo). For then we have some 
known invariants to work with We explore this situation in this section. 

We know that if X is a path-connected space, and if œ is a path in X from xo 
to x1, then there is an isomorphism å of the fundamental group based at xo with the 
fundamental group based at x1, but the isomorphism depends on the choice of the path 
a. A stronger result holds for the group 7, /{71, 71]. In this case, the isomorphism of 
the “abelianized fundamental group” based at xo with the one based at x), induced by 
a, is in fact independent of the choice of the path a. 

To verify this fact, it suffices to show that if a and £ are two paths from x9 to x1, 
then the path g = a + B induces the identity isomorphism of 1/{71, 1] with itself. 
And this is easy. If { f] € 21 (X, xo), we have 


aLf] = [8 * f * g] = [g] + [f] + fel. 


When we pass to the cosets in the abelian group 71/[71, 71], we see that g induces the 
identity map. 


Definition. If X is a path-connected space, let 
H(X) = nı (X, xo)/[m (X, xo), 21 (X, xo)}. 


We call Hı(X) the first homology group of X. We omn the base point from the 
notation because there is a unique path-induced isomorphism between the abelianized 
fundamental groups based at two different points. 


If you study algebraic topology further, you will see an entirely different defini- 
tion of Hı(X). In fact, you will see groups H, (X) called the homology groups of X 
that are defined for all n > 0. These are abelian groups that are topological invariants 
of X; they are of fundamental importance in applying results of algebra to problems 
of topology. A theorem due to W. Hurewicz establishes a connection between these 
groups and the homotopy groups of X. It implies in particular that for a path-connected 
space X, the first homology group Hı (X) of X is isomorphic to the abelianized funda- 
mental group of X. This theorem motivates our choice of notation for the abelianized 
fundamental group. 

To compute H(X) for the surfaces considered earlier, we need the following re- 
sult: 


Theorem 75.1. Let F be a group; let N be a normal subgroup of F ; letq : F > F/N 
be the projection. The projection homomorphism 


p:F > F/[F, F) 
Induces an isomorphism 


$ : 4(F)/[q(F), q(F)] > p(F)/p(N). 
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This theorem states, roughly speaking, that if one divides F by N and then abelian- 
izes the quotient, one obtains the same result as if one first abelianizes F and then 
divides by the image of N in this abelianization. 


Proof. One has projection homomorphisms p, q, r, s, as in the following diagram, 
where q(F) = F/N and p(F) = F/[F, F]. 


pP - q(F)/(q(F).q(F)) 
pate wee 


p(F) — P(F)/p(N) 


Because r o p maps N to 1, it induces a homomorphism u : q(F) > p(F)/p(N). 
Then because p(F)/p(N) is abelian, the homomorphism u induces a homomorphism 
$ of q(F)/[q(F), q(F)]. On the other hand, because s o q maps F into an abelian 
group, it induces a homomorphism v . p(F) > q(F)/[q(F), ¢q(F)]. Because s o q 
cames N to 1, so does v o p; hence v induces a homomorphism y of p(F)/p(N). 
The homomorphism ¢ can be described as follows: Given an element y of the 
group q(F)/([q(F),q(F)], choose an element x of F such that s(q(x)) = y; then 
(y) = r(p(x)) The homomorphism ¥ can be described similarly. It follows that @ 
and yf are inverse to each other. a 


Corollary 75.2. Let F be a free group with free generators a, ..., @n; let N be 
the least normal subgroup of F containing the element x of F; let G = F/N. Let 
p . F — F/[F,F] be projection. Then G/[G, G] is isomorphic to the quotient 
of F/[F, F], which is free abelian with basis pla), ..., p(a@n), by the subgroup 
generated by p(x). 


Proof. Note that because N is generated by x and all its conjugates, the group p(N) 
is generated by p(x) The corollary then follows from the preceding theorem. a 


Theorem 75.3. If X is the n-fold connected sum of tori, then H,(X) is a free abelian 
group of rank 2n. 


Proof. In view of the preceding corollary, Theorem 74.3 implies that H; (X) is iso- 
morphic to the quotient of the free abelian group F’ on the set a), Bj,..., Œn, By by the 
subgroup generated by the element [a), 81] -{an, Bn), where [a, 8} = apa! p! 
as usual. Because the group F” is abelian, this element equals the identity element. E 


Theorem 75.4. If X is them-fold connected sum of projective planes, then the torsion 
subgroup T (X) of H,(X) has order 2, and Hı(X)/T (X) is a free abelian group of rank 
m-l. 

Proof. In view of the preceding corollary, Theorem 74.4 implies that H, (X) is iso- 
morphic to the quotient of the free abelian group F’ on the set a, ..., @m by the 
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subgroup generated by (a )?---(@m)*. If we switch to additive notation (which is 
usual when dealing with abelian groups), this is the subgroup generated by the element 
2(@ +- -+am). Let us change bases in the group F’. If we let B = a, +--- +om, then 
the elements a1,.. ,@m—1, 8 form a basis for F’, any element of F’ can be written 
uniquely in terms of these elements. The group H(X) is isomorphic to the quo- 
tient of the free abelian group on aj,...,@m—1, B by the subgroup generated by 28. 
Said differently, H,(X) is isomorphic to the quotient of the m-fold cartesian product 
Zx--xZbythesubgroup0 x x0 x 2Z. The theorem follows a 


Theorem 75.5. Let T, and Pm denote the n-fold connected sum of ton and the m- 
fold connected sum of projective planes, respectively. Then the surfaces S?; Ti „Ta, 
...; Pj, P2,... are topologically distinct. 


Exercises 


1. Calculate H,(P?#T). Assuming that the list of compact surfaces given in Theo- 
rem 75.5 is a complete list, to which of these surfaces is P?#T homeomorphic? 


2. If K is the Klein bottle, calculate H,(K) directly. 


3. Let X be the quotient space obtained from an 8-sided polygonal region P by 
pasting its edges together according to the labelling scheme acadbcb™'d. 
(a) Check that all vertices of P are mapped to the same point of the quotient 
space X by the pasting map. 
(b) Calculate H; (X). 
(c) Assuming X is homeomorphic to one of the surfaces given in Theorem 75.5 
(which it is), which surface is it? 

*4. Let X be the quotient space obtained from an 8-sided polygonal region P by 
means of the labelling scheme abcdad~'cb-'. Let x : P — X be the quotient 
map. 

(a) Show that x does not map all the vertices of P to the same point of X. 

(b) Determine the space A = z (Bd P) and calculate its fundamental group. 

(c) Calculate zı (X, xo) and H;(X) 

(d) Assuming X is homeomorphic to one of the surfaces given in Theorem 75.5, 
which surface is it? 


§76 Cutting and Pasting 


To prove the classification theorem, we need to use certain geometric arguments in- 
volving what are called “cut-and-paste” techniques. These techniques show how to 
take a space X that is obtained by pasting together the edges of one or more polygonal 
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regions according to some labelling scheme and to represent X by a different collection 
of polygonal regions and a different labelling scheme. 

First, let us consider what it means to “cut apart” a polygonal region. Let P be 
a polygonal region with successive vertices po, ..., Pn = po, as usual. Given k with 
l < k < n- 1, let us consider the polygonal regions Q1, with successive vertices 
Po, Pi, ---, Pk, Po. and Q2, with successive vertices po, px,-- , Pn = po. These 
regions have the edge pop; in common, and the region P is their union. 

Let us move Q; by a translation of R? so as to obtain a polygonal region 2‘ 
that is disjoint from Q2; then Q; has successive vertices go, g1, ---. 9k. qo, Where q; 
is the image of p; under the translation. The regions Q) and Q2 are said to have 
been obtained by cutting P apart along the line from po to p. The region P is 
homeomorphic to the quotient space of Q} and Q2 obtained by pasting the edge of Qf 
going from qo to qx to the edge of Q2 going from po to px, by the positive linear map 
of one edge onto the other. See Figure 76.1. 


lo 


Figure 76.1 


Now let us consider how we can reverse this process. Suppose we are given two 
disjoint polygonal regions 2 with successive vertices qo,. , 4,40, and Q2, with 
successive vertices po, Pk. -.., Pn = po. And suppose we form a quotient space by 
pasting the edge of Q' from qo to qx onto the edge of Q2 by po to px, by the positive 
linear map of one edge onto the other. We wish to represent this space by a polygonal 
region P. 

This task is accomplished as follows: The points of Qz lie on a circle and are 
arranged in counterclockwise fashion. Let us choose points pj, ..., pk-ı on this 
same circle in such a way that po, pi... , Pk—1, pk are arranged in counterclockwise 
order, and let Q; be the polygonal region with these as successive vertices. There is a 
homeomorphism of Q', onto Q; that cames q; to p; for each i and maps the edge goqx 
of Q) linearly onto the edge pop: of Q2. Therefore, the quotient space in question 
is homeomorphic to the region P that is the union of Q; and Q2. We say that P is 
obtained by pasting Qi and Q2 together along the indicated edges. See Figure 76.2. 

Now we ask the following question: If a polygonal region has a labelling scheme, 
what effect does cutting the region apart have on this labelling scheme? More pre- 
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cisely, suppose we have a collection of disjoint polygonal regions P4, ..., Pm anda 
labelling scheme for these regions, say w1, ..., Wm, Where w; is a labelling scheme 
for the edges of P;. Suppose that X is the quotient space obtained from this labelling 
scheme. If we cut P) apart along the line from po to py, what happens? We obtain 
m + 1 polygonal regions Q Q2, P2,..., Pm. to obtain the space X from these re- 
gions, we need one additional edge pasting. We indicate the additional pasting that is 
required by introducing a new label that is to be assigned to the edges qog and po pk 
that we introduced. Because the orientation from po to px is counterclockwise for Q2, 
and the orientation from qo to qx is clockwise for Q’ , this label will have exponent +1 
when it appears in the scheme for Q2 and exponent —1 when it appears in the scheme 
for Q). 

Let us be more specific. We can write the labelling scheme w; for Pı in the 
form w; = yoy1, Where yo consists of the first k terms of w; and yı consists of the 
remainder Let c be a label that does not appear in any of the schemes wi, ..., Wm- 
Then give Q; the labelling scheme yoc™!, give Q2 the labelling scheme cy, and for 
i > 1 give the region P; its old scheme w;. 

It is immediate that the space X can be obtained from the regions Qi» Q2, Po, 
.--, Pm by means of this labelling scheme. For the composite of quotient maps is a 
quotient map, so it does not matter whether we paste all the edges together at once, or 
instead paste the edge pop; to the edge qoq; before pasting the others! 

One can of course apply this procedure in reverse. If X is represented by a labelling 
scheme for the regions Qb Q2, P2,..., Pm and if the labelling scheme indicates that 
an edge of the first is to be pasted to an edge of the second (and no other edge is to 
be pasted to these), we can actually carry out the pasting so as to represent X by a 
labelling scheme for the m regions P4, ..., Pm. 

We state this fact formally as a theorem: 


Theorem 76.1. Suppose X is the space obtained by pasting the edges of m polygonal 
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regions together according to the labelling scheme 


(*) YOYI, W2,---, Wm- 


Let c be a label not appearing in this scheme. If both yo and y, have length at least 
two, then X can also be obtained by pasting the edges of m + | polygonal regions 
together according to the scheme 


(**) yoc™!, cy, W2, -.-, Wm- 


Conversely, if X is the space obtained from m + | polygonal regions by means of 
the scheme (#*), it can also be obtained from m polygonal regions by means of the 
scheme (*), providing that c does not appear in scheme (*). 


Elementary operations on schemes 


We now list a number of elementary operations that can be performed on a labelling 
scheme w1, ..., wm Without affecting the resulting quotient space X. The first two 
anse from the theorem just stated. 

(i) Cut. One can replace the scheme w; = yoy; by the scheme yoc~! and cy, 
provided c does not appear elsewhere in the total scheme and yo and y; have length at 
least two. 

(ii) Paste. One can replace the scheme yoc~! and cy, by the scheme yoy}, pro- 
vided c does not appear elsewhere in the total scheme. 

(iii) Relabel. One can replace all occurrences of any given label by some other 
label that does not appear elsewhere in the scheme. Similarly, one can change the 
sign of the exponent of all occurrences of a given label a; this amounts to reversing 
the orientations of all the edges labelled “a”. Neither of these alterations affects the 
pasting map. 

(iv) Permute. One can replace any one of the schemes w; by a cyclic permutation 
of w;. Specifically, if w; = yoy, we can replace w; by yi yo. This amount to renum- 
bering the vertices of the polygonal region P; so as to begin with a different vertex; it 
does not affect the resulting quotient space. 

(v) Flip. One can replace the scheme 


wi = (a) -+ (a, 
by its formal inverse 
w = (ap) --- (ai). 


This amounts simply to “flipping the polygonal region P; over.”. The order of the 
vertices is reversed, and so is the orientation of each edge. The quotient space X is not 
affected. 

(vi) Cancel. One can replace the scheme w; = yoaa~'y, by the scheme yoyi, 
provided a does not appear elsewhere in the total scheme and both yo and y; have 
length at least two. 
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This last result follows from the three-step argument indicated in Figure 76.3, only 
one step of which is new. Letting b and c be labels that do not appear elsewhere in the 
total scheme, one first replaces yọaa™! yı by the scheme yoab and b~'a—'y,, using the 
cutting operation (i). Then one combines the edges labelled a and b in each polygonal 
region into a single edge, with a new label. This is the step that is new. The result is 
the scheme yoc and c~!y,, which one can replace by the single scheme yoyi, using 
the pasting operation (ii). 


Figure 76.3 


(vii) Uncancel. This is the reverse of operation (vi). It replaces the scheme yoy; 
by the scheme yoaa™! y;, where a is a label that does not appear elsewhere in the total 
scheme. We shall not actually have occasion to use this operation. 


Definition. We define two labelling schemes for collections of polygonal regions 
to be equivalent if one can be obtained from the other by a sequence of elementary 
scheme operations. Since each elementary operation has as its inverse another such 
operation, this is an equivalence relation. 


EXAMPLE | The Klein bottle K is the space Obtained from the labelling scheme 
aba~'b. In the exercises of §74, you were asked to show that K is homeomorphic to the 
2-fold projective plane P?#P?. The geometnc argument suggested there in fact consists of 
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the following elementary operations 


aba~'b — abc"! andca~'b by cutting 


—> c7!ab and b~!ac7! by permuting the first 
and flipping the second 


— c laac"! by pasting 


— aacc by permuting and relabelling. 


Exercises 


1. Consider the quotient space X obtained from two polygonal regions by means of 
the labelling scheme w; = acbc™} and wz = cdba~'d. 

(a) If one pastes these regions together along the edges labelled “a,” one can 
represent X as the quotient space of a single 7-sided region P. What is a 
labelling scheme for P? What sequence of elementary operations is involved 
in obtaining this scheme? 

(b) Repeat (a), pasting along the edges labelled “b”. 

(c) Explain why one cannot paste along the edges labelled “c” to obtain the 
scheme acbdba—'d as a way of representing X. 


2. Consider the space X obtained from two polygonal regions by means of the 
labelling scheme w; = abcc and w) = c~'c~'ab. The following sequence of 
elementary operations: 


abcc and c~'e~'ab —> ccab and b~!a~! ce by permuting 


and flipping 
— ccaa'ce by pasting 
— cece by cancelling 


indicates that X is homeomorphic to the four-fold dunce cap. The sequence of 
Operations 


abec and c~'c~'ab — abcc™'ab by pasting 
— abab by cancelling 


indicates that X is homeomorphic to P?. But these two spaces are not homeo- 
morphic. Which (if either) argument is correct? 


§77 The Classification Theorem 


We prove in this section the geometric part of our classification theorem for surfaces. 
We show that every space obtained by pasting the edges of a polygonal region together 
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in pairs is homeomorphic either to S?, to the n-fold torus Tz, or to the m-fold projective 
plane Pm. Later we discuss the problem of showing that every compact surface can be 
obtained in this way. 

Suppose w4, ..., wx is a labelling scheme for the polygonal regions P, ,..., Pr. 
If each label appears exactly twice in this scheme, we call it a proper labelling scheme. 
Note the following important fact: 

If one applies any elementary operation to a proper scheme, one obtains another 
proper scheme, 


Definition. Let w be a proper labelling scheme for a single polygonal region. We 
say that w is of torus type if each label in it appears once with exponent +} and once 
with exponent — 1. Otherwise, we say w is of projective type. 


We begin by considering a scheme w of projective type. We will show that w 
is equivalent to a scheme (of the same length) in which all labels having the same 
exponent are paired and appear at the beginning of the scheme. That is, w is equivalent 
to a scheme of the form 


(a,a1){a2a2) - - - (axax)wi, 


where wy; is of torus type or is empty. 

Because w is of projective type, there is at least one label, say a, such that both 
occurrences of a in the scheme w have the same exponent. Therefore, we can assume 
that w has the form 


w = yoay1ay2. 


where some of the y; may be empty. We shall insert brackets in this expression for 
visual convenience, writing it in the form 


w = [yolalyiJaLy2]. 
We have the following result: 
Lemma 77.1. Let w be a proper scheme of the form 
w = [yo]a{yı ]a[y2], 
where some of the y; may be empty. Then one has the equivalence 
w ~ aalyoy; 'y2] 


where y’ denotes the formal inverse of y;. 


Proof. Step 1. We first consider the case where yo is empty. We show that 


aly, Jaly2] ~ aly; ' y2). 
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Figure 77.1 


If yı is empty, this result is immediate, while if y2 is empty, it follows from flipping, 
permuting, and relabelling. If neither is empty, we apply the cutting and pasting argu- 
ment indicated in Figure 77.1, followed by a relabelling. We leave it to you to wnte 
down the sequence of elementary operations involved. 

Step 2. Now we consider the general case. Let w = [yo]a[y;Ja[y2], where yo is 
not empty. If both yı and yz are empty, the lemma follows by permuting. Otherwise, 
we apply the cutting and pasting argument indicated in Figure 77.2 to show that 


w ~ biyzlblyiyg ]. 


It follows that 


w ~ bblyz yiyg l] by Step 1 
~ [yoyr 'y2]b7'b' by flipping 
~ aalyoy; ya] by permuting and relabelling. m 
4) 
S 
Y 
a \ m paar 
EWN 
Y2 Yo 


Figure 77.2 


Corollary 77.2. Ifw is a scheme of projective type, then w is equivalent to a scheme 
of the same length having the form 


(aa) )(a2a2)--- (axax)wy, 


where k > | and w is either empty or of torus type. 
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Proof. The scheme w can be written in the form 


w = [yo]a[yı Jaly2); 


then the preceding lemma implies that w is equivalent to a scheme of the form w’ = 
aaw) that has the same length as w. If w; is of torus type, we are finished; otherwise, 
we can write w’ in the form 


w' = aa(zo]b[z1Jb[z2] = [aazo]b[zı]blz2). 


Applying the preceding lemma again, we conclude that w’ is equivalent to a scheme w” 
of the form 


w” = bblaazz; 'z2] = bbaaun, 


where w” has the same length as w. If w2 is of torus type, we are finished; otherwise, 
we continue the argument similarly. a 


It follows from the preceding corollary that if w is a proper labelling scheme for a 
polygonal region, then either (1) w is of torus type, or (2) w is equivalent to a scheme 
of the form (a;4))...(a,ax)w), where w, is of torus type, or (3) w is equivalent to a 
scheme of the form (a; 4;) ... (axax). In case (3), we are finished, for such a scheme 
represents a connected sum of projective planes. So let us consider cases (1) and (2). 

At this point, we note that if w is a scheme of length greater than four of the form 
indicated in case (1) or case (2), and if w contains two adjacent terms having the same 
label but opposite exponents, then the cancelling operation may be applied to reduce w 
to a shorter scheme that is also of the form indicated in cases (1), (2), or (3). Therefore, 
we can reduce w either to a scheme of length four, or to a scheme that does not contain 
two such adjacent terms. 

Schemes of length four are easy to deal with, as we shall see later, so let us assume 
that w does not contain two adjacent terms having the same label but opposite expo- 
nents. In that case, we show that w is equivalent to a scheme w’, of the same length 
as w, having the form 


w = aba~'b-!w” in case (1) or 
w’ = (ajaı) <- - (akag)aba™!b tw” in case (2), 


where w” is of torus type or is empty. This is the substance of the following lemma: 


Lemma 77.3. Let w be a proper scheme of the form w = wowi, where w; is a 
scheme of torus type that does not contain two adjacent terms having the same label. 
Then w is equivalent to a scheme of the form wow2, where wz has the same length 
as w, and has the form 


w = aba~'b-'w3, 


where w3 is of torus type or is empty. 
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Proof. This is the most elaborate proof of this section; three cuttings and pastings are 
involved. We show first that, switching labels and exponents if necessary, w can be 
written in the form 


(*) w = wolyi aly: lbly3]a~'Cya]b~' Lys], 


where some of the y; may be empty. 

Among the labels appearing in w;, let a be one whose two occurrences (with 
opposite exponents of course) are as close together as possible. These occurrences 
are nonadjacent, by hypothesis. Switching exponents if necessary, we can assume that 
the term a occurs first and the term a~! occurs second. Let b be any label appearing 
between a and a~!; we can assume its exponent is +1. Now the term b~! appears 
in w4, but cannot occur between a and a~! because these two are as close together as 
possible. If b~! appears following a~!, we are finished. If it appears preceding a, then 
all we need to do is to switch exponents on the b terms, and then switch the labels a 
and b, to obtain a scheme of the desired form. 

So let us assume that w has the form (x). 


First cutting and pasting. We show that w is equivalent to the scheme 
w' = woa[yz]blys]a ~" [yı ya]b7' [ys]. 
To prove this result, we rewrite w in the form 
w = wolyi Ja[yzbys]a ~" [y4b™" ys]. 


We then apply the cutting and pasting argument indicated in Figure 77.3 to conclude 
that 


w ~ wclyzbys]lc "iyi yab" ys] 
~ woalyzlblys]a ~ [yiyalb 7" Lys], 


by relabelling. Note that the cut at c can be made because both the resulting polygons 
have at least three sides. 


Figure 77.3 


Second cutting and pasting. Given 


w = woaly2|bly3]aq' [y1 y4]b7" [ys], 
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we show that w’ is equivalent to the scheme 
w” = woalyi yay3lba™'b~'[y2y5]. 


If all the schemes y1, y4, ys, and wo are empty, then the argument is easy, since in 
that case 


w' = a[yz]bly3]a 7b", 
~ b[ys]a7'b'a[ly2] by permuting 
~ afys]ba™!b—!{y2] by relabelling 


= w”. 


Yz h 


Ya Ys 


Figure 77.4 
Otherwise, we apply the argument indicated in Figure 77.4 to conclude that 
w = woaly2}blysla~ "Lyi ya]b [ys] 
~ woc[yiy4y3ja ~ "c7 'aly2y5} 
~ woalyiysy3]ba~'b~'Lyzys], 


by relabelling. 
Third cutting and pasting. We complete the proof. Given 


w” = woalyi yay3lba ~" b [y2y5]. 
we show that w” is equivalent to the scheme 


a 


w” = woaba™’ b~! [yr yaysyays}. 
If the schemes wo, ys, and yz are empty, the argument is easy, since in that case 
w” = alyi ya y3]ba ~'o! 
~ baq'b™'alyysy3] by permuting 
~aba`!'b™}{yiyąy3] by relabelling 


mm 
=W. 
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Otherwise, we apply the argument indicated in Figure 77.5 to conclude that 


w” = woalyiyay3]ba~'b~!Lyzys] 
'clalytyaysy2ys) 
~ waba™! b! [y y4 y3y275], 


~ woca` 


by relabelling, as desired. a 
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Figure 77.5 
The final step of our classification procedure involves showing that a connected 


sum of projective planes and tori is equivalent to a connected sum of projective planes 
alone. 


Lemma 77.4. Let w be a proper scheme of the form 
w= wo(cc)(aba~!b-") wy. 
Then w is equivalent to the scheme 
w = wo(aabbcc) wv. 
Proof. Recall Lemma 77.1, which states that for proper schemes we have 


(*) lyolalyıla[y2] ~ aalyoy, " y2). 
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We proceed as follows: 


w~ (cc)(aba~'!b-!)w, wo by permuting 
= cc[ab}[ba] ' [wiwo] 
~ [ab]c[ba]c[wiwo] by (*) read backwards 
= [a]b[c]b[acw wo] 
~ bblacacwwo] by (*) 
= [bb]a[c] 'a[cwi wo} 
~ aal[bbccw wo] by (*) 
~ woaabbccw by permuting. a 


Theorem 77.5 (The classification theorem). Let X be the quotient space obtained 
from a polygonal region in the plane by pasting its edges together in pairs. Then X is 
homeomorphic either to S?, to then-fold torus T,, or to the m-fold projective plane Pm. 


Proof. Let w be the labelling scheme by which one forms the space X from the 
polygonal region P. Then w is a proper scheme of length least 4. We show that w is 
equivalent to one of the following schemes: 
(1) aa™'bb™!, 
(2) abab, 
(3) (arai )(a202) +: (amam) withm > 2, 
(4) (arbia; 'bi')(azb2a3 'b3')--- (anbnaz bz!) withn> 1. 
The first scheme gives rise to the space S?, and the second, to the space P? , as we 
noted in Examples 2 and 4 of §74. The third leads to the space Pm and the fourth to 
the space T}. 
Step 1. Let w be a proper scheme of torus type. We show that w is equivalent 
either to scheme (1) or to a scheme of type (4). 
It w has length four, then it can be written in one of the forms 


aa'bb~' or aba'b7!. 


The first is of type (1) and the second of type (4). 

We proceed by induction on the length of w. Assume w has length greater than 
four. If w is equivalent to a shorter scheme of torus type, then the induction hypothesis 
applies. Otherwise, we know that w contains no pair of adjacent terms having the 
same label. We apply Lemma 77.3 (with wo empty) to conclude that w is equivalent 
to a scheme having the same length as w, of the form 


aba~'b7' w3, 


where w3 is of torus type. Note that w3 is not empty because w has length greater 
than four. Again, w3 cannot contain two adjacent terms having the same label, since 
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w is not equivalent to a shorter scheme of torus type. Applying the lemma again, with 
wo = aba~'b-', we conclude that w is equivalent to a scheme of the form 


(aba™!b (ede !d wg, 


where w4 is empty or of torus type. If w4 is empty, we are finished; otherwise we 
apply the lemma again. Continue similarly. 

Step 2. Now let w be a proper scheme of projective type. We show that w is 
equivalent either to scheme (2) or to a scheme of type (3). 

If w has length four, Corollary 77.2 implies that w is equivalent to one of the 
schemes aabb or aab™!b. The first is of type (3). The second can be written in the 
form aay, 'y2, with yi = y2 = b; then Lemma 77.1 implies that it is equivalent to the 
scheme ayay = abab, which is of type (2). 

We proceed by induction on the length of w. Assume w has length greater than 
four. Corollary 77.2 tells us that w is equivalent to a scheme of the form 


w' = (a,q;)--- (axag)wy, 


where k > | and w; is of torus type or empty. If w, is empty, we are finished. If w 
has two adjacent terms having the same label, then w’ is equivalent to a shorter scheme 
of projective type and the induction hypothesis applies. Otherwise, Lemma 77.3 tells 
us that w’ is equivalent to a scheme of the form 


w” = (ajay) --- (agax)aba~'b-'wo, 


where wz is either empty or of torus type. Then we apply Lemma 77.4 to conclude 
that w” is equivalent to the scheme 


(a1a))--- (azaxy)aabbw. 


We continue similarly. Eventually we reach a scheme of type (3). a 


Exercises 


1. Let X be a space obtained by pasting the edges of a polygonal region together in 
pairs. 
(a) Show that X is homeomorphic to exactly one of the spaces in the following 
list: $?, P?, K, Ta, Tn#P, Ta#K , where K is the Klein bottle and n > 1. 
(b) Show that X is homeomorphic to exactly one of the spaces in the following 
list: S?, Ta, P?, Kin, P?#K m, where Km is the m-fold connected sum of K 
with itself and m > 1. 
2. (a) Write down the sequence of elementary operations required to carry out the 
arguments indicated in Figures 77.1 and 77.2. 
(b) Write down the sequence of elementary operations required to carry out the 
arguments indicated in Figures 77.3, 77.4, and 77.5. 
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3. The proof of the classification theorem provides an algorithm for taking a proper 
labelling scheme for a polygonal region and reducing it to one of the four stan- 
dard forms indicated in the theorem. The appropriate equivalences are the fol- 
lowing: 

G) Dolaly Jafy2] ~ aalyoy; 'y2). 
(ii) (yolaa~'{y1] ~ [yoyı} if yoy: has length at least 4. 
(iii) wolyilalyz]blys]a 7" [y4]b7"[ys] ~ woaba™! b~! [yi yaysyzys). 
(iv) wo(cc)(aba—'b-")w, ~ woaabbecwy. 
Using this algorithm, reduce each of the following schemes to one of the standard 


forms. 
(a) abacb~'c7!, 
(b) abca™'cb. 


(c) abbca™'ddc™!. 

- (d) abcda™'b~te™td7!. 
(e) abcda™!c™!b~td™'. 
(f) aabcde™!b—'d-!. 
(g) abcdabdc. 

(h) abcdabcd. 


4. Let w be a proper labelling scheme for a 10-sided polygonal region. If w is of 
projective type, which of the list of spaces in Theorem 77.5 can it represent? 
What if w is of torus type? 


§78 Constructing Compact Surfaces 


To complete our classification of the compact surfaces, we must show that every com- 
pact connected surface can be obtained by pasting together in pairs the edges of a 
polygonal region. We shall actually prove something slightly weaker than this, for we 
shall assume that the surface in question has what is called a triangulation. We define 
this notion as follows: 


Definition. Let X be a compact Hausdorff space. A curved triangle in X is a sub- 
space A of X and a homeomorphism h : T —> A, where T is a closed triangular 
region in the plane. If e is an edge of T, then A(e) is is said to be an edge of A; if 
v is a vertex of T, then A(v) is said to be a vertex of A. A triangulation of X isa 
collection of curved triangles A), ..., A, in X whose union is X such that fori Æ j, 
the intersection A; N A; is either empty, or a vertex of both A; and Aj, or an edge of 
both. Furthermore, if h; : T; — A, is the homeomorphism associated with A;, we 
require that when A, N Aj is an edge e of both, then the map hy th; defines a linear 


homeomorphism of the edge h7’ (e) of T; with the edge hile) of Tj. If X has a 
triangulation, it is said to be friangulable. 
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It is a basic theorem that every compact surface is triangulable. The proof is long 
but not exceedingly difficult. (See [A-S] or [D-M].) 


Theorem 78.1. If X is a compact triangulable surface, then X is homeomorphic to 
the quotient space obtained from a collection of disjoint triangular regions in the plane 
by pasting their edges together in pairs. 


Proof. Let Aj,..., An bea triangulation of X, with corresponding homeomorphisms 
h, : T; — Aj. We assume the triangles 7; are disjoint; then the maps 4; combine to 
define a map h : E = Ti U---UT, — X that is automatically a quotient map. 
(E is compact and X is Hausdorff.) Furthermore, because the map h7! o h; is linear 
whenever A; and A; intersect in an edge, h pastes the edges of T; and T; together by 
a linear homeomorphism. 

We have two things to prove. First, we must show that for each edge e of a trian- 
gle Aj, there is exactly one other triangle A; such that A; N A; = e. This will show 
that the quotient map h pastes the edges of the triangles T; together in pairs. 

The second is a bit less obvious. We must show that if the intersection A; N A; 
equals a vertex v of each, then there is a sequence of triangles having v as a vertex, 
beginning with A; and ending with A}, such that the intersection of each triangle of 
the sequence with its successor equals an edge of each. See Figure 78.1. 


pe Ss 


Figure 78.1 


If this were not the case, one might have a situation such as that pictured in Fig- 
ure 78.2. Here, one cannot specify the quotient map h merely by specifying how the 
edges of the triangles 7; are to be pasted together, but one must also indicate how the 
vertices are to be identified when that identification is not forced by the pasting of 
edges. 

Step 1. Let us tackle the second problem first. We show that because the space X 
is a surface, a situation such as that indicated in Figure 78.2 cannot occur. 

Given v, let us define two triangles A, and A; having v as a vertex to be equivalent 
if there is a sequence of triangles having v as a vertex, beginning with A; and ending 
with A j, such that the intersection of each triangle with its successor is an edge of each. 
If there is more than one equivalence class, let B be the union of the triangles in one 
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Figure 78.2 


class and let C be the union of the others. The sets B and C intersect in v alone because 
no triangle in B has an edge in common with a triangle in C. We conclude that for 
every sufficiently small neighborhood W of v in X, the space W — v is nonconnected. 

On the other hand, if X is a surface, then v has a neighborhood homeomorphic to 
an open 2-ball. In this case, v has arbitrarily small neighborhoods W such that W — v 
is connected. 


Step 2. Now we tackle the first question. This is a bit more work. First, we show 
that, given an edge e of the triangle A;, there is at least one additional triangle A; 
having e as an edge. This is a consequence of the following result: 

If X is a triangular region in the plane and if x is a point interior to one of the 
edges of X, then x does not have a neighborhood in X homeomorphic to an open 
2-ball. 

To prove this fact, we note that x has arbitrarily small neighborhoods W for which 
W — x is simply connected. Indeed, if W is the €-neighborhood of x in X, for e small, 
then it is easy to see that W — x is contractible to a point. See Figure 78.3. 


x 


Figure 78.3 


On the other hand, suppose there is a neighborhood U of x that is homeomorphic 
to an open ball in R?, with the homeomorphism carrying x to 0. We show that x does 
not have arbitrarily small neighborhoods W such that W — x is simply connected. 

Indeed, let B be the open unit ball in R? centered at the origin, and suppose V is 
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any neighborhood of 0 that is contained in B. Choose e so that the open ball B. of 
radius € centered at 0 lies in V, and consider the inclusion mappings 


The inclusion i is homotopic to the homeomorphism A(x) = x/e, so it induces an 
isomorphism of fundamental groups. Therefore, k, is surjective; it follows that V — 0 
cannot be simply connected. See Figure 78.4. 


Figure 78.4 


Step 3. Now we show that given an edge e of the triangle A,, there is no more than 
one additional triangle A ; having e as an edge. This is a consequence of the following 
result: 

Let X be the union ofk triangles in R?, each pair of which intersect in the common 
edge e. Let x be an interior point ofe. If k = 3, then x does not have a neighborhood 
in X homeomorphic to an open 2-ball. 

We show that there is no neighborhood W of x in X such that W — x has abelian 
fundamental group. It follows that no neighborhood of x is homeomorphic to an open 
2-ball. 

To begin, we show that if A is the union of all the edges of the triangles of X that 
are different from e, then the fundamental group of A is not abelian. The space A is 
the union of a collection of k arcs, each pair of which intersect in their end points. If 
B is the union of three of the arcs that make up A, then there is a retraction r of A 
onto B, obtained by mapping each of the arcs not in B homeomorphically onto one 
of the arcs in B, keeping the end points fixed. Then r, is an epimorphism. Since the 
fundamental group of B is not abelian (by Example 1 of §70 or Example 3 of §58), 
neither is the fundamental group of A. 

It follows that the fundamental group of X — x is not abelian, for it is easy to see 
that A is a deformation retract of X — x. See Figure 78.5. 

Now we prove our result. For convenience, assume x is the origin in R>. If W is an 
arbitrary neighborhood of 0, we can find a “shrinking map” f(x) = ex that carries X 
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Figure 78.5 


into W. The space Xe = f(X) is acopy of X lying inside W. Consider the inclusions 


xX. -9 P 


w-0O 


The inclusion is homotopic to the homeomorphism h(x) = x/€, so it induces an iso- 
morphism of fundamental groups. It follows that k, is surjective, so the fundamental 
group of W — 0 cannot be abelian. m 


Theorem 78.2. If X is a compact connected triangulable surface, then X is homeo- 
morphic to a space obtained from a polygonal region in the plane by pasting the edges 
together in pairs. 


Proof. It follows from the preceding theorem that there is a collection 7), ..., Ta of 
triangular regions in the plane, and orientations and a labelling of the edges of these 
regions, where each label appears exactly twice in the total labelling scheme, such that 
X is homeomorphic to the quotient space obtained from these regions by means of this 
labelling scheme. 

We apply the pasting operation of §76. If two triangular regions have edges bear- 
ing the same label, we can (after flipping one of the regions if necessary) paste the 
regions together along these two edges. The result is to replace the two triangular re- 
gions by a single four-sided polygonal region, whose edges still bear orientations and 
labels. We continue similarly. As long as we have two regions having edges bearing 
the same label, the process can be continued. 

Eventually one reaches the situation where either one has a single polygonal re- 
gion, in which case the theorem is proved, or one has several polygonal regions, no 
two of which have edges bearing the same label. In such a case, the space formed by 
carrying out the indicated pasting of edges is not connected; in fact, each of the regions 
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gives rise to a component of this space. Since the space X is connected, this situation 
cannot occur. a 


Exercises 


1. What space is indicated by each of the following labelling schemes for a collec- 
tion of four triangular regions? 
(a) abc, dae, bef, cdf. 
(b) abc, cba, def, dfe™!. 
2. Let H? be the subspace of R? consisting of all points (x;, x2) with x2 > 0. A 2- 
manifold with boundary (or surface with boundary) is a Hausdorff space X with 
a countable basis such that each point x of X has a neighborhood homeomorphic 
with an open set of R? or H?. The boundary of X (denoted 3X) consists of 
those points x such that x has no neighborhood homeomorphic with an open set 
of R?. 
(a) Show that no point of H? of the form (x1, 0) has a neighborhood (in H?) 
that is homeomorphic to an open set of R?. 
(b) Show that x € 3X if and only if there is a homeomorphism h mapping a 
neighborhood of x onto an open set of H? such that h(x) € R x 0. 
(c) Show that dX is a 1-manifold. 


3. Show that the closed unit ball in R? is a 2-manifold with boundary. 


4. Let X be a 2-manifold; let U1, ..., Ux be a collection of disjoint open sets in 
X; and suppose that for each i, there is a homeomorphism h; of the open unit 
ball B? with U;. Let € = 1/2 and let Be be the open ball of radius €. Show that 
the space Y = X — JA, (Be) is a 2-manifold with boundary, and that 3Y has 
k components. The space Y is called ““X-with-k-holes.” 


5. Prove the following: 
Theorem. Given a compact connected triangulable 2-manifold Y with bound- 
ary, such that ƏY has k components, then Y is homeomorphic to X -with-k-holes, 
where X is either S? or the n-fold torus Ta or the m-fold projective plane Pm. 
[Hint: Each component of 3Y is homeomorphic to a circle.} 


Chapter 13 


Classification of Covering Spaces 


Up to this point, we have used covering spaces primarily as a tool for computing 
fundamental groups. Now we turn things around and use the fundamental group as a 
tool for studying covering spaces. 

To do this in any reasonable way, we must restrict ourselves to the case where B 
is locally path connected. Once we have done this, we may as well require B to be 
path connected as well, since B breaks up into the disjoint open sets By that are its 
path components, and the maps p~! (By) — Ba obtained by restricting p are covering 
maps, by Theorem 53.2. We may as well assume also that E is path connected. For if 
Eq is a path component of p~} (Ba), then the map Ea —> Ba obtained by restricting p 
is also a covering map. (See Lemma 80.1.) Therefore, one can determine all cover- 
ings of the locally path-connected space B merely by determining all path-connected 
coverings of each path component of B! 

For this reason, we make the following: 


Convention. Throughout this chapter, the statement that p : E > B is a covering 
map will include the assumption that E and B are locally path connected and path 
connected, unless specifically stated otherwise. 


With this convention, we now describe the connection between covering spaces 
of B and the fundamental group of B. 
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If p: E — B is a covering map, with p(eo) = bo, then the induced homomor- 
phism p, is injective, by Theorem 54.6, so that 


Ho = paini (E, €0)) 


is a subgroup of xı (B, bo) isomorphic to 71 (E, eg). It turns out that the subgroup Ho 
determines the covering p completely, up to a suitable notion of equivalence of cover- 
ings. This we shall prove in §79. Furthermore, under a (fairly mild) additional “local 
niceness” condition on B, there exists, for each subgroup Ho of 71(B, bo), a covering 
p: E — B of B whose corresponding subgroup is Ho. This we shall prove in §82. 

Roughly speaking, these results show that one can determine all covering spaces 
of B merely by examining the collection of all subgroups of 7,(B, bo). This is the 
classical procedure of algebraic topology; one “solves” a problem of topology by re- 
ducing it to a problem of algebra, hopefully one that is more tractable. 

Throughout the chapter, we assume the general lifting correspondence theorem, 
Theorem 54.6. 


§79 Equivalence of Covering Spaces 


In this section, we show that the subgroup Ho of 771(B, bo) determines the covering 
p: E — B completely, up to a suitable notion of equivalence of coverings. 


Definition. Let p : E — Band p’: E’ — B be covering maps. They are said to 
be equivalent if there exists a homeomorphism h : E —> E’ such that p = p’ oh. 
The homeomorphism A is called an equivalence of covering maps or an equivalence 


of covering spaces. 
h E 
B 


Given two covering maps p : E —> B and p’ : E’ —> B whose corresponding 
subgroups Ho and Hy are equal, we shall prove that there exists an equivalence h : 
E — E'. For this purpose, we need to generalize the lifting lemmas of §54. 


E 


Lemma 79.1 (The general lifting lemma). Let p : E — B be a covering map; 
let p(eo) = bo. Let f : Y — B be a continuous map, with f (yọ) = bo. Suppose 
Y is path connected and locally path connected. The map f can be lifted to a map 
f : Y = E such that f (yo) = ep if and only if 


fei (Y, yo)) C pa(m(E, €0)). 


Furthermore, if such a lifting exists, it is unique. 


§79 Equivalence of Covering Spaces 479 


Proof, If the lifting f exists, then 


fai (¥, Yo)) = pa (fai CY, yo))) C pe(i(E, €0)). 


This proves the “only if” part of the theorem. 

Now we prove that if f exists, it is unique. Given y; € Y, choose a path a in Y 
from yo to yı. Take the path f oa in B and lift it to a path y in E beginning at eo. If 
a lifting f of f exists, then f(y1) must equal the end point y(1) of y, for f o æ isa 
lifting of f o æ that begins at eo, and path liftings are unique. 

Finally, we prove the “if” part of the theorem. The uniqueness part of the proof 
gives us a clue how to proceed. Given yı € Y, choose a path a in Y from yo to yı. 
Lift the path f o æ to a path y in E beginning at eo, and define f(y) = y(1). See 
Figure 79.1. It is a certain amount of work to show that f is well-defined, independent 
of the choice of a. Once we prove that, continuity of f is proved easily, as we now 
show. 


Figure 79.1 


To prove continuity of f at the point yı of Y, we show that, given a neighbor- 
hood N of f(y1), there is a neighborhood W of yı such that f(W) C N. To be- 
gin, choose a path-connected neighborhood U of f(y,) that is evenly covered by p. 
Break p~'(U) up into slices, and let Vp be the slice that contains the point f(y1). 
Replacing U by a smaller neighborhood of f(yı) if necessary, we can assume that 
Vo C N. Let po : Vo > U be obtained by restncting p; then po is a homeomor- 
phism. Because f is continuous at y; and Y is locally path connected, we can find 
a path-connected neighborhood W of yı such that f(W) C U. We shall show that 
f (W) C Vp; then our result is proved. 

Given y € W, choose a path £ in W from y; to y. Since f is well defined, f(y) 
can be obtained by taking the path a + £ from yp to y, lifting the path f o (œ + $) toa 
path in £ beginning at eo, and letting f(y) be the end point of this lifted path. Now y 
is a lifting of a that begins at e9. Since the path fo lies in U, the path ô = Pe ofop 
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is a lifting of it that begins at fo. Then y * 4 is a lifting of f o (a + p) that begins 
at eo; it ends at the point 5(1) of Vo. Hence f(W) C Vo, as desired. 

Finally, we show f is well defined. Let œ and £ be two paths in Y from yọ to yy. 
We must show that if we lift f oœ and f o £ to paths in E beginning at eo, then these 
lifted paths end at the same point of E. 

First, we lift f o æ toa path y in E beginning at ep; then we lift f o 8 toa path 3 
in £ beginning at the end point y(1) of y. Then y xô is a lifting of the loop f o (a *8). 
Now by hypothesis, 


f(n (Y, yo)) C pe(m(E, €0)). 


Hence {f o (a » B)] belongs to the image of pẹ. Theorem 54.6 now implies that its lift 
y +ô is a loop in E. 

It follows that f is well defined. For 4 is a lifting of f o £ that begins at eo, and y 
is a lifting of f o a that begins at eo, and both liftings end at the same point of £. W 


Theorem 79.2. Let p: E — B and p’: E' —> B be covering maps; let p(eo) = 
p’ (eg) = bo. There is an equivalence h : E —> E’ such that h(eọ) = e if and only if 
the groups 


Ho = p, (mı (E, e0)) and Hy = pi(m(E’, 6) 


are equal. If h exists, it is unique. 
Proof. We prove the “only if” part of the theorem. Given h, the fact that h is a 
homeomorphism implies that 


hy (71(E, e0)) = mi (E', e0). 


Since p'o h = p, we have Ho = Ah. 
Now we prove the “if” part of the theorem; we assume that Ho = Hg and show 
that A exists. We shall apply the preceding lemma (four times!). Consider the maps 


E' 
| 
E-B. 


Because p’ is a covering map and E is path connected and locally path connected, 
there exists a map h : E > E’ with h (eo) = e9 that is a lifting of p (that is, such that 
p'o h = p). Reversing the roles of E and E’ in this argument, we see there is a map 
k : E' > E with k(e9) = eo such that p o k = p’. Now consider the maps 


E 


[> 


E —B. 
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The map koh : E — E is a lifting of p (since pokoh = p'oh = p), with p(e9) = eo. 
The identity map ig of £ is another such lifting. The uniqueness part of the preceding 
lemma implies that koh = ig. A similar argument shows that 4 ok equals the identity 
map of £’. a 


We seem to have solved our equivalence problem. But there is a somewhat subtle 
point we have overlooked. We have obtained a necessary and sufficient condition for 
there to exist an equivalence h : E —> E’ that carries the point eo to the point eg. 
But we have not yet determined under what conditions there exists an equivalence in 
general. It is possible that there may be no equivalence carrying €o to eg but that there 
is an equivalence carrying eo to some other point e} of ( p’)~' (bo). Can we determine 
whether this is the case merely by examining the subgroups Ho and Hj? We consider 
this problem now. 

If Hı and H are subgroups of a group G, you may recall from algebra that they 
are said to be conjugate subgroups if Hz = a - Hı - a~! for some element a of G. 
Said differently, they are conjugate if the isomorphism of G with itself that maps x to 
a+ x - a`! carries the group Hj onto the group Hp. It is easy to check that conjugacy 
is an equivalence relation on the collection of subgroups of G. The equivalence class 
of the subgroup H is called the conjugacy class of H. 


Lemma 79.3. Let p : E > B bea covering map. Let eg ande, be points of p`! (bo), 
and let H; = p(z (E, ei)). 

(a) Ify isa path in E from eg to e;, and a is the loop po y in B, then the equation 
[a] * H, + [a]! = Ho holds; hence Ho and H, are conjugate. 

(b) Conversely, given eo, and given a subgroup H of 2\(B, bo) conjugate to Ho, 
there exists a point e} of p~' (bo) such that Hj = H. 


Proof. (a) First, we show that [a] x Hı «fay ic Ho. Given an element {A} of A, we 
have [A] = p,({h]) for some loop Å in E based at e}. Let k be the path k = = (y *h)«7; 
itis a loop in E based at eo, and 


ps({k]) = ((@ * h) * &] = [a] * [h] * [a], 


so the latter element belongs to p,(77 (E, €9)) = Ho, as desired. See Figure 79.2. 
Now we show that [a] » H) * {a]~! > Ho. Note that y is a path from e; to é9 and 
& equals the loop p o y. By the result just proved, we have 


[a] * Ho »* [&]7! cM, 


which implies out desired result. 

(b) To prove the converse, let eo be given and let H be conjugate to Ho. Then 
Ho = (a]*H {a}! for some loop a in B based at bọ. Let y be the lifting of a toa path 
in £ beginning at eg, and let e} = y(1). Then (a) implies that Ho = [a] * H, * [a]. 
We conclude that H = H}. B 
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Figure 79.2 


Theorem 79.4. Let p: E > Band p’: E’ > B be covering maps; let p(eo) = 
p’(€y) = bo. The covering maps p and p’ are equivalent if and only if the subgroups 


Ho = ps(™1(E,e0)) and = Hy = p, (mi (E’, 9) 


of 7\(B, bo) are conjugate. 


Proof. Ifh: E —> E' isan equivalence, lete] = h(eo), and let Hj = px(771(E’, e})). 
Theorem 79.2 implies that Hp = Hj, while the preceding lemma tells us that H; is 
conjugate to Ho. 


Conversely, if the groups Ho and Hy are conjugate, the preceding lemma implies 


there is a point e| of £’ such that H; = Ho. Theorem 79.2 then gives us an equivalence 
h : E — E’ such that h(e9) = e}. a 


EXAMPLE 1. Consider covering spaces of the circle B = S!. Because 3, (B, bo) is 
abelian, two subgroups of 2 (B, bo) are conjugate if and only if they are equal. Therefore 
two coverings of B are equivalent if and only if they correspond to the same subgroup of 
xı (B, bo). 

Now x1 (B, bo) is isomorphic to the integers Z. What are the subgroups of Z? It is 
standard theorem of modem algebra that, given a nontrivial subgroup of Z, it must be the 
group G, consisting of all muluples of n, for some n € Z4. 

We have studied one covering space of the circle, the covenng p : R — S!. It 
must correspond to the trivial subgroup of 7r; (S!, bo), because R is simply connected. We 
have also considered the covering p : S! — S' defined by p(z) = 2", where z is a 
complex number. In this case, the map p, carries a generator of 7r; (S!, bo) into n times 
itself. Therefore, the group p.(71(S ' bo)) corresponds to the subgroup G, of Z under the 
standard isomorphism of x; (S!, bp) with Z. 

We conclude from the preceding theorem that every path-connected covering space 
of S! is equivalent to one of these coverings. 
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Exercises 


1. Show that if n > 1, every continuous map f : S” + S! is nulhomotopic. (Hint: 
Use the lifting lemma.] 

2. (a) Show that every continuous map f : P? — S! is nulhomotopic. 

(b) Find a continuous map of the torus into S! that is not nulhomotopic. 

3. Let p : E — B bea covering map; let p(eo) = bo. Show that Ho = 
P»(71(E, e9)) is a normal subgroup of xı (B, bo) if and only if for every pair 
of points e), e2 of p~! (bo), there is an equivalence h : E — E with h(e1) = e2. 

4. Let T = S! x S!, the torus. There is an isomorphism of 71(T, bo x bo) with 
Z x Z induced by projections of T onto its two factors. 

(a) Find a covering space of T corresponding to the subgroup of Z x Z generated 
by the element m x 0, where m is a positive integer. 

(b) Find a covering space of T corresponding to the trivial subgroup of Z x Z. 

(c) Find a covering space of T corresponding to the subgroup of Z x Z generated 
by m x 0 and 0 x n, where m and n are positive integers. 

*5, Let T = S! x S! be the torus; let x9 = bo x bo. 

(a) Prove the following: 

Theorem. Every isomorphism of 1\(T, x) with itself is induced by a 
homeomorphism of T with itself that maps x9 to xo. 

(Hint: Let p : R? — T be the usual covering map. If A is a 2 x 2 matrix 
with integer entries, the linear map T, : R? —> R? with matrix A induces a 
continuous map f : T — T. Furthermore, f is a homeomorphism if A is 
invertible over the integers.] 

(b) Prove the following: 

Theorem. If E is a covering space of T, then E is homeomorphic either 
to R?, orto S! x R, ortoT. 
(Hint: You may use the following result from algebra: If F is a free abelian 
group of rank 2 and N is a nontrivial subgroup, then there is a basis a4, a2 
for F such that either (1) ma, is a basis for N, for some positive integer m, 
or (2) maj, naz is a basis for N, where m and n are positive integers.] 

*6. Prove the following: 

Theorem. Let G be a topological group with multiplication operation m : G x 

G — G and identity element e. Assume p : G > G is a covering map. Given é 

with p(é) = e, there is a unique multiplication operation on G that makes it into 

a topological group such that £ is the identity element and p is a homomorphism. 

Proof, Recall that, by our convention, G and G are path connected and locally 

path connected. 

(a) Let / : G — G be the map /(g) = g~". Show there exist unique maps 
m: 6Gx6é > GandI:G a A e R 
poň =mo (p x p)and po Ï = łop. 

(b) Show the maps G-G given by g — m(é x g) and g — m(g x e) equal 
the identity map of G. [Hint: Use the uniqueness part of Lemma 79.1] 
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(c) Show the maps G —> G given by g —> m(g x 1(g)) and g > m(I(g) x 8) 
map G to é. ee! de 7 
(d) Show the maps G x G x G — G given by 


Bx 8 xg” — mg x mg’ x 8”) 
xg xg” > m(m(g x 2’) xg”) 


are equal. 
(e) Complete the proof. 


7. Let p : G+ Gbea homomorphism of topological groups that is a covering 
map. Show that if G is abelian, so is G. 


$80 The Universal Covering Space 


Suppose p : E — B is a covering map, with p(eo) = bo. If E is simply connected, 
then £ is called a universal covering space of B. Since m,(E, eo) is trivial, this cov- 
ering space corresponds to the trivial subgroup of x (B, bo) under the correspondence 
defined in the preceding section. Theorem 79.4 thus implies that any two universal 
covering spaces of B are equivalent. For this reason, we often speak of “the” universal 
covering space of a given space B. Not every space has a universal covering space, as 
we shall see. For the moment, we shall simply assume that B has a universal covering 
space and derive some consequences of this assumption. 
We prove two preliminary lemmas: 


Lemma 80.1. Let B be path connected and locally path connected. Let p : E + B 
be a covering map in the former sense (so that E is not required to be path connected). 
If Eo is a path component of E, then the map po : Ey > B obtained by restricting p 
is a covering map. 


Proof. We first show po is surjective. Since the space E is locally homeomorphic 
to B, it is locally path connected. Therefore Eg is open in E. It follows that p(Eo) is 
open in B. We show that p( Ep) is also closed in B, so that p(Eo) = B. 

Let x be a point of B belonging to the closure of p(Ep). Let U be a path-connected 
neighborhood of x that is evenly covered by p. Since U contains a point of p(Eo), 
some slice Vz of p~'(U) must intersect Eo. Since Vz is homeomorphic to.U, it is 
path connected; therefore it must be contained in Eg. Then p( Va) = U is contained 
in p(Eo), so that in particular, x € p(Eo). 

Now we show po : Ey — B is a covering map. Given x € B, choose a neigh- 
borhood U of x as before. If Vz is a slice of p~!(U), then Va is path connected; if it 
intersects Eo, it lies in Eo. Therefore, po ly ) equals the union of those slices Vy of 
p7!(U) that intersect Eo; each of these is open in Ep and is mapped homeomorphi- 
cally by po onto U. Thus U is evenly covered by po. a 
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Lemma 80.2. Let p,q, andr be continuous maps with p = r oq, as in the following 
diagram: 


(a) If p andr are covering maps, so Is q. 
*(b) If p and q are covering maps, so isr. 


Proof. By our convention, X, Y, and Z are path connected and locally path con- 
nected. Let xo € X; set yo = q (xo) and zo = p(Xp). 

(a) Assume that p and r are covering maps. We show first that q is surjective. 
Given y € Y, choose a path & in Y from yo to y. Thena =ro&isa path in Z 
beginning at zo; let a be a lifting of a to a path in X beginning at xo. Then q o & is a 
lifting of a to Y that begins at yọ. By uniqueness of path liftings, & = q o &. Then q 
maps the end point of & to the end point y of & Thus q is surjective. 

Given y € Y, we find a neighborhood of y that is evenly covered by q. Let z = 
r(y). Since p and r are covering maps, we can find a path-connected neighborhood U 
of z that is evenly covered by both p and r. Let V be the slice of r~! (U) that contains 
the point y; we show V is evenly covered by q. Let {Ua} be the collection of slices 
of p~'(U). Now q maps each set Ue into the set r~'(U); because Ua is connected, 
it must be mapped by q into a single one of the slices of r`} (U). Therefore, qi! (V) 
equals the union of those slices Uy that are mapped by q into V. It is easy to see that 
each such U, is mapped homeomorphically onto V by q. For let po, qo, ro be the maps 
obtained by restricting p, q, and r, respectively, as indicated in the following diagram: 


Ua 
al S 


wee 


U 


Because po and ro are homeomorphisms, so is go = rg lo po- 

*(b) We shall use this result only in the exercises. Assume that p and q are cover- 
ing maps. Because p = r o q and p is surjective, r is also surjective. 

Given z € Z, let U be a path-connected neighborhood of z that is evenly covered 
by p. We show that U is also evenly covered by r. Let { Vg} be the collection of path 
components of r~!(U/); these sets are disjoint and open in Y. We show that for each £, 
the map r carries Vg homeomorphically onto U. 

Let {Ua} be the collection of slices of p'(U ); they are disjoint, open, and path 
connected, so they are the path components of p~! (U). Now q maps each Va into the 
set r7! (U); because Ua is connected, it must be mapped by q into one of the sets Vg- 
Therefore q7! (Vg) equals the union of a subcollection of the collection {Ua}. Theo- 
rem 53.2 and Lemma 80.1 together imply that if Ua, is any one of the path components 
of q7! (Vg) then the map go : Uag > Vg obtained by restricting q is a covering map. 
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In particular, go is surjective. Hence go is a homeomorphism, being continuous, open, 
and injective as well. Consider the maps 


a 
i 
Po Vg 
Ya 


obtained by restricting p, q, and r. Because po and go are homeomorphisms, so is rọ. 
a 


Theorem 80.3. Let p: E — B be a covering map, with E simply connected. Given 
any covering mapr : Y — B, there is a covering mapq : E — Y such that roq = p. 


This theorem shows why E is called a universal covering space of B; it covers 
every other covering space of B. 


Proof. Let bo € B; choose eg and yo so that p(eọo) = bo and r(yo) = bo. We apply 
Lemma 79.1 to construct q. The map r is a covering map, and the condition 


pei (E, €0)) C rai (Y, yo)) 


is satisfied trivially because E is simply connected. Therefore, there is a map q : E > 
Y such that r og = p and q(e9) = yo. It follows from the preceding lemma that q is 
a covering map. a 


Now we give an example of a space that has no universal covering space. We need 
the following lemma. 


Lemma 80.4. Let p : E — B be a covering map; let p(eo) = bo. If E is simply 
connected, then bo has a neighborhood U such that inclusion i : U —> B induces the 
trivial homomorphism 


i, : 1 (U, bo) — ™(B, bo). 


Proof. Let U be a neighborhood of bo that is evenly covered by p; break p~! (U) up 
into slices; let Ua be the slice containing e9. Let f be a loop in U based at bp. Because 
p defines a homeomorphism of Ua with U, the loop f lifts to a loop fin Uy based 
at eo. Since E is simply connected, there is a path homotopy F in E between f and a 
constant loop. Then p o F is a path homotopy in B between f and a constant loop. @ 
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EXAMPLE l. Let X be our familiar “infinite earring” in the plane; if C, is the circle 
of radius 1/7 in the piane with center at the point (1/7, 0), then X is the union of the 
circles Ca. Let bp be the ongin; we show that if U is any neighborhood of bp in X, then 
the homomorphism of fundamental groups induced by inclusion i : U —> X is not trivial. 

Given n, there is a retraction r : X — C, obtained by letung r map each circle C, 
for i Æ n to the point bp. Choose n large enough that Cn lies in U. Then in the following 
diagram of homomorphisms induced by inclusion, ją is injective; hence ¢, cannot be trivial. 
ie 


nı (Cr, bo) m (X, bo) 
zi (U, bo) 


It follows that even though X is path connected and locally path connected, it has no 
universal covering space. 


Exercise 


1. Letg : X — Y andr: Y + Z be maps; let p =r oq. 
(a) Let g and r be covering maps. Show that if Z has a universal covering space, 
then p is a covering map. Compare Exercise 4 of §53. 
*(b) Give an example where q and r are covering maps but p is not. 


*§81 Covering Transformations 


Given a covering map p : E — B, it is of some interest to consider the set of all 
equivalences of this covering space with itself. Such an equivalence is called a cov- 
ering transformation. Composites and inverses of covering transformations are cov- 
ering transformations, so this set forms a group; it is called the group of covering 
transformations and denoted C(E, p, B). 

Throughout this section, we shall assume that p : E — B is a covering map 
with p(e9) = bo; and we shall let Ho = p.(711(E,e0)). We shall show that the 
group C(E, p, B) is completely determined by the group 71 (B, bo) and the subgroup 
Ho. Specifically, we shali show that if N (Ho) is the largest subgroup of 7, (B, bo) of 
which Ap is a normal subgroup, then C(E, p, B) is isomorphic to N (Ho) / Ho. 

We define N( Ho) formally as follows: 


Definition. If H is a subgroup of the group G, then the normalizer of H in G is the 
subset of G defined by the equation 
N(H) = (g | gHg7! = H}. 


It is easy to see that N(H) is a subgroup of G. It follows from the definition that it 
contains H as a normal subgroup and is the largest such subgroup of G. 
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The correspondence between the groups N(Ho)/ Ho and C(E, p, B) is established 
by using the lifting correspondence of $54 and the results about the existence of equiv- 
alences proved in §79. We make the following definition: 


Definition. Given p : E —> B with p(eọ) = bo, let F be the set F = p`! (eo). Let 
®: 1\(B, bo)/Ho > F 


be the lifting correspondence of Theorem 54.6; it is a bijection. Define also a corre- 
spondence 


W:C(E, p, B) > F 


by setting Y(h) = h(eo) for each covering transformation h : E —> E. Since h is 
uniquely determined once its value at eo is known, the correspondence W is injective. 


Lemma 81.1. The image of the map W equals the image under ® of the subgroup 
N(Ao)/Ho of 11(B, bo)/ Ho. 


Proof. Recall that the lifting correspondence ¢ : 7;(B, bo) > F is defined as fol- 
lows: Given a loop a in B at bp, let y be its lift to E beginning at eg; let e} = y(1); 
and define ¢ by setting @([a]) = e1. To prove the lemma, we need to show that there 
is a covering transformation h : E > E with h(eo) = e; if and only if [a] € N (Ho). 

This is easy. Lemma 79.1 tells us that A exists if and only if Hp = Hı, where 
H; = p.(m\(E, e1)). And Lemma 79.3 tells us that [ar] » Hı * [a]7! = Ho. Hence h 
exists if and only if [a] + Ho * {a)~' = Ho, which is simply the statement that [a] € 
N (Ho). B 


Theorem 81.2. The bijection 
© 'oW: C(E, p, B) > N(Ho)/Ho 


is an isomorphism of groups. 


Proof. We need only show that ®~! o © is a homomorphism. Leth, k : E > E be 
covering transformations. Let h(eọ) = e, and k(e9) = e2; then 


Y(h) =e; and W(k) =e, 


by definition. Choose paths y and ô in E from eọ to e; and e2, respectively. Ifa = poy 
and £ = p o ô, then 


@((a]Ho) =e, and ¢@([8]Ho) = e2, 
by definition. Let e3 = h(k(eo)); then Y (h o k) = e3. We show that 
D (la * B]Ho) = 63, 
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and the proof is complete. 

Since ô is a path from eo to e2, the path h o ô is a path from h(eọ) = e; to 
h(e2) = h(k(e9)) = e3. See Figure 81.1. Then the product y * (h o ô) is defined and is 
a path from eọ to e3. It is a lifting of a * $, since po y =a and pohod = poô = $. 


Therefore ®({a * 8] Ho) = e3, as desired. a 
d % 
Al hod 


Figure 81.1 


Corollary 81.3. The group Ho is a normal subgroup of x,(B, bo) if and only if for 
every pair of points e, and e2 of p~! (bo), there is a covering transformation h : E —> 
E with h(e,) = e2. In this case, there is an isomorphism 


$7! oW: C(E, p, B) > ™(B, bo)/ Ho. 


Corollary 81.4. Let p: E — B be a covering map. If E is simply connected, then 
C(E, p, B) = nı (B, bo). 


If Ho is a normal subgroup of 7 (B, bo), then p : E —> B is called a regular 
covering map. (Here is another example of the overuse of familiar terms. The words 
“normal” and “regular” have already been used to mean quite different things!) 


EXAMPLE |. Because the fundamental group of the circle is abelian, every covering 
of S! is regular. If p - R — S! is the standard covenng map, for instance, the covering 
transformations are the homeomorphisms x —> x + n. The group of such transformations 
is isomorphic to Z. 


EXAMPLE 2. For an example at the other extreme, consider the covering space of the 
figure eight indicated in Figure 81.2. (We considered this covering earlier, in §60. The 
x-axis is wrapped around the Circle A and the y-axis is wrapped around B. The circles A; 
and B, are mapped homeomorphically onto A and B, respectively.) In this case, we show 
that the group C(E, p, B) is tnvial. 
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In general, if h : E — E is a covering transformation, then any loop in the base space 
that lifts to a loop in £ at eo also lifts to a loop when the lift begins at A(e9). In the present 
case, a loop that generates the fundamental group of A lifts to a non-loop when the lift 
is based at eo and lifts to a Joop when it is based at any other point of p~!(bp) lying on 
the y-axis. Similarly, a loop that generates the fundamental group of B lifts to a non-loop 
beginning at go and to a loop beginning at any other point of p~} (bo) lying on the x-axis. 
It follows that A(e9) = eo, so that A is the identity map. 


Figure 81.2 


There is a method for constructing covering spaces that automatically leads to a 
covering that is regular; and in fact every regular covering space can be constructed by 
this method. It involves the action of a group G on a space X. 


Definition. Let X be a space, and let G be a subgroup of the group of homeomor- 
phisms of X with itself. The orbit space X/G is defined to be the quotient space 
obtained from X by means of the equivalence relation x ~ g(x) for all x € X and all 
g € G. The equivalence class of x is called the orbit of x. 


Definition. If G is a group of homeomorphisms of X, the action of G on X is said 
to be properly discontinuous if for every x € X there is a neighborhood U of x such 
that g(U) is disjoint from U whenever g # e. (Here e is the identity element of G.) 
It follows that go(U) and g;(U) are disjoint whenever go Æ g1, for otherwise U and 
89 '81(U) would not be disjoint. 


Theorem 81.5. Let X be path connected and locally path connected; let G be a group 
of homeomorphisms of X. The quotient map x : X —> X/G is a covering map if and 
only if the action of G is properly discontinuous. In this case, the covering map n is 
regular and G is its group of covering transformations. 
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Proof. We show x is an open map. If U is open in X, then x~'x(U) is the union of 
the open sets g(U) of X, for g € G. Hence x~'x(U) is open in X, so that z (U) is 
open in X/G by definition. Thus 7 is open. 

Step 1. We suppose that the action of G is properly discontinuous and show that 7 
is a covering map. Given x € X, let U be a neighborhood of x such that gọ(U) and 
gi(U) are disjoint whenever go Æ gi. Then z (U) is evenly covered by 2. Indeed, 
x—'n(U) equals the union of the disjoint open sets g(U), for g € G, each of which 
contains at most one point of each orbit. Therefore, the map g(U) —> x (U) obtained 
by restricting 7 is bijective; being continuous and open, it is a homeomorphism. The 
sets g(U), for g € G, thus form a partition of m~!2(U) into slices. 

Step 2. We suppose now that x is a covering map and show that the action of G is 
properly discontinuous. Given x € X, let V be a neighborhood of 7 (x) that is evenly 
covered by 2. Partition x~'(V) into slices; let Ug be the slice containing x. Given 
g € G with g £ e, the set g(U,) must be disjoint from Ug, for otherwise, two points 
of Ua would belong to the same orbit and the restriction of m to Ug would not be 
injective. It follows that the action of G is properly discontinuous. 


Step 3. We show that if x is a covering map, then G is its group of covering 
transformations and 7 is regular. Certainly any g € G is a covering transformation, 
for n o g = n because the orbit of g(x) equals the orbit of x. On the other hand, let A 
be a covering transformation with h(x) = x2, say. Because 2 o h = x, the points x; 
and x2 map to the same point under 7; therefore there is an element g € G such that 
g(x1) = x2. The uniqueness part of Theorem 79.2 then implies that h = g. 

It follows that x is regular. Indeed, for any two points x, and x2 lying in the same 
orbit, there is an element g € G such that g(x;) = x2. Then Corollary 81.3 applies. W 


Theorem 81.6. If p : X — B is a regular covering map and G is its group of 
covering transformations, then there is a homeomorphism k : X/G — B such that 
p=kon, where n : X — X/G ts the projection. 


xX = x 
J) |e 
X/G—>B 


Proof. If g is a covering transformation, then p(g(x)) = p(x) by definition. Hence 
p is constant on each orbit, so it induces a continuous map k of the quotient space X/G 
into B. On the other hand, p is a quotient map because it is continuous, surjective, and 
open. Because p is regular, any two points of p~! (b) belong to the same orbit under 
the action of G. Therefore, x induces a continuous map B —> X/G that is an inverse 
for k. a 


EXAMPLE 3. Let X be the cylinder S! x /; leth : X —> X be the homeomorphism 
h(x,t) = (-x, t), and let k : X — X be the homeomorphism k(x, t) = (—x, 1 — t). 
The groups G) = {e, h} and G2 = fe, k} are isomorphic to the integers modulo 2; both 
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act properly discontinuously on X. But X/G, is homeomorphic to X, while X/Gz is 
homeomorphic to the Mobius band, as you can check. See Figure 81.3. 


mem O+& 


Figure 81.3 


Exercises 


1. (a) Find a group G of homeomorphisms of the torus T having order 2 such that 
T/G is homeomorphic to the torus. 
(b) Find a group G of homeomorphisms of T having order 2 that T/G is home- 
omorphic to the Klein bottle. 


2. Let X = A V B be the wedge of two circles. 
(a) Let E be the space pictured in Figure 81.4; let p : E —> X wrap each arc A; 
and A2 around A and map B, and B2 homeomorphically onto B. Show p is 
a regular covering map. 
(b) Determine the group of covering transformations of the covering of X indi- 
cated in Figure 81.5. Is this covering regular? 


Figure 81.5 
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(c) Repeat (b) for the covering pictured in Figure 81.6. 
(d) Repeat (b) for the covering pictured in Figure 81.7. 


A 8. 


i A 


1 3 


Figure 81.7 


3. Let p : X — B be a covering map (not necessarily regular); let G be its group 
of covering transformations. 
(a) Show that the action of G on X is properly discontinuous. 
(b) Let z : X —> X/G be the projection map. Show there is a covering map 
k: X/G — B suchthatk ox = p. 


4. Let G be a group of homeomorphisms of X. The action of G on X is said to 
be fixed-point free if no element of G other than the identity e has a fixed point. 
Show that if X is Hausdorff, and if G is a finite group of homeomorphisms of X 
whose action is fixed-point free, then the action of G is properly discontinuous. 


5. Consider S? as the space of all pairs of complex numbers (z4, z2) satisfying the 
equation |z|? + [z2 = 1. Given relatively prime positive integers n and k, 
define h : S? —> S? by the equation 


hzi, z2) = (z1e?™?/", zel"), 
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(a) Show that A generates a subgroup G of the homeomorphism group of S? 
that is cyclic of order n, and that only the identity element of G has a fixed 
point. The orbit space S?/G is called the lens space L(n, k). 

(b) Show that if L(n, k) and L(n’, k’) are homeomorphic, then n = n’. [It is a 
theorem that L(n, k) and L(n’, k’) are homeomorphic if and only if n = n’ 
and either k = k’ (mod n) or kk’ = 1 (mod n). The proof is decidedly 
nontrivial.] 

(c) Show that L(n, k) is a compact 3-manifold. 

6. Prove the following: 

Theorem. Let X be a locally compact Hausdorff space; let G be a group of 

homeomorphisms of X such that the action of G is fixed-point free. Suppose 

that for each compact subspace C of X, there are only finitely many elements g 

of G such that the intersection C N g(C) is nonempty. Then the action of G is 

properly discontinuous, and X/G is locally compact Hausdorff. 

Proof. 

(a) For each compact subspace C of X, show that the union of the sets g(C), for 
g € G, is closed in X. [Hint: If U is a neighborhood of x with U compact, 
then U UC intersects g(U U C) for only finitely many g.] 

(b) Show X/G is Hausdorff. 

(c) Show the action of G is properly discontinuous. 

(d) Show X/G is locally compact. 


§82 Existence of Covering Spaces 


We have shown that corresponding to each covering map p : E —> B is a conjugacy 
class of subgroups of 2; (B, bo), and that two such covering maps are equivalent if and 
only if they correspond to the same such class. Thus, we have an injective correspon- 
dence from equivalence classes of coverings of B to conjugacy classes of subgroups of 
71(B, bo). Now we ask the question whether this correspondence is surjective, that is, 
whether for every conjugacy class of subgroups of 71 (B, bo), there exists a covering 
of B that corresponds to this class. 

The answer to this question is “no,” in general. In §80, we gave an example of a 
path-connected, locally path-connected space B that had no simply connected cover- 
ing space, that is, that had no covering space corresponding to the class of the trivial 
subgroup. This example relied on Lemma 80.4, which gave a condition that any space 
having a simply connected covering space must satisfy. We now introduce this condi- 
tion formally. 


Definition. A space B is said to be semilocally simply connected if for each b € B, 
there is a neighborhood U of b such that the homomorphism 
i, : nı(U, b) > 1 (B, b) 


induced by inclusion is trivial. 
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Note that if U satisfies this condition, then so does any smaller neighborhood of b, 
so that b has “arbitrarily small” neighborhoods satisfying this condition. Note also that 
this condition is weaker than true local simple connectedness, which would require that 
within each neighborhood of b there should exist a neighborhood U of b thatis itself 
simply connected. 

Semilocai simple connectedness of B is both necessary and sufficient for there to 
exist, for every conjugacy class of subgroups of 7, (B, bo), a corresponding covering 
space of B. Necessity was proved in Lemma 80.4; sufficiency is proved in this section. 


Theorem 82.1. Let B be path connected, locally path connected, and semilocally 
simply connected. Let bọ € B. Given a subgroup H of n, (B, bo), there exists a 
covering map p : E —> B and a point eg € p~! (bo) such that 


px(™1(E, e0)) = H. 


Proof. Step 1. Construction of E. The procedure for constructing E is reminiscent 
of the procedure used in complex analysis for constructing Riemann surfaces. Let P 
denote the set of all paths in B beginning at bo. Define an equivalence relation on P 
by setting a ~ £ if a and £ end at the same point of B and 


[ax Bled. 


This relation is easily seen to be an equivalence relation. We will denote the equiva- 
lence class of the path a by a". 

Let E denote the collection of equivalence classes, and define p : E —> B by the 
equation 


p(a*) = a(1). 


Since B is path connected, p is surjective. We shall topologize E so that p is a covering 
map. 
We first note two facts: 

(1) If [æ] = [£], then a” = p*. 

(2) Ifa" = p*, then (a * 5)* = (8 x ô)" for any path ô in B beginning at a(1). 
The first follows by noting that if [a] = [£], then {a * B] is the identity element, which 
belongs to H. The second follows by noting that a * ô and £ * ô end at the same point 
of B, and 


[Cæ * 5) * (B * 5)] = [(a * 8) * (5 * B)] = [a * ĝ], 


which belongs to H by hypothesis. 


Step 2. Topologizing E. One way to topologize E is to give P the compact-open 
topology (see Chapter 7) and E the corresponding quotient topology. But we can 
topologize E directly as follows: 
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Let a be any element of P, and let U be any path-connected neighborhood of 
a(l). Define 
B(U, a) = {(@ x ô)” | ô is a path in U beginning at a(!)}. 


Note that aë is an element of B(U, a), since if b = a(l), then aë = (a x ep)"; this 
element belongs to B(U, a) by definition. We assert that the sets B(U, a) form a basis 
for a topology on E. 


First, we show that if p* € B(U,a), thena” € BCU, B) and B(U, a) = B(U, B). 
If B* € B(U, æ), then B* = (æ * 5)* for some path ô in U. Then 
(B +ô)" =((a * ô) xô)" by (2) 
=a" by (1), 
so that a# € B(U, B) by definition. See Figure 82.1. We show first that B(U, B) C 
B(U, a). Note that the general element of B(U, p) is of the form (£ « y)*, where y is 
a path in U. Then note that 
(8 * y)" = ((a =ô) *y)* 
= (a * (ô * y))“, 


which belongs to B(U, a) by definition. Symmetry gives the inclusion B(U,a) C 
B(U, B) as well. 


Figure 82.1 


Now we show the sets B(U,a) form a basis. If p* belongs to the intersection 
B(U, a1) O B(U2, a2), we need merely choose a path-connected neighborhood V 
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of B(1) contained in V, N U2. The inclusion 
B(V, B) C B(U;, B) N B(U2, B) 


follows from the definition of these sets, and the right side of the equation equals 
B(U;, a) O B(U2, a2) by the result just proved. 

Step 3. The map p is continuous and open. It is easy to see that p is open, for 
the image of the basis element B(U, œ) is the open subset U of B: Given x € U, we 
choose a path 6 in U from a(1) to x; then (a * 5)* is in B(U, a) and p((a * 5)*) = x. 

To show that p is continuous, let us take an element a* of E and a neighborhood W 
of p(a*). Choose a path-connected neighborhood U of the point p(a*) = a(1) lying 
in W. Then B(U, a) is a neighborhood of a* that p maps into W. Thus p is continuous 
at at. 

Step 4. Every point of B has a neighborhood that is evenly covered by p. Given 
b; € B, choose U to be a path-connected neighborhood of b; that satisfies the further 
condition that the homomorphism 7z, (U, bı) —> 2)(B, 51) induced by inclusion is 
trivial. We assert that U is evenly covered by p. 

First, we show that p~'(U) equals the union of the sets B(U,a), as a ranges 
over all paths in B from bo to bı. Since p maps each set B(U, a) onto U, it is clear 
that p~'(U) contains this union. On the other hand, if £” belongs to p`! (U), then 
B(1) € U. Choose a path 5 in U from b; to B(1) and let æ be the path £ * ô from bo 
to by. Then [8] = [æ * ô], so that 8” = (a * 5)*, which belongs to B(U, a). Thus 
p~'(U) is contained in the union of the sets B(U, æ). 

Second, note that distinct sets B(U, æ) are disjoint. For if £” belongs to B(U, a1)N 
B(U, a2), then B(U, a) = BCU, B) = B(U, a2), by Step 2. 

Third, we show that p defines a bijective map of B(U, a) with U. It follows that 
p|B(U, a) is ahomeomorphism, being bijective and continuous and open. We already 
know that p maps B(U, a) onto U. To prove injectivity, suppose that 


p((a * 81)*) = p((a * 82)*), 


where 5, and 42 are paths in U. Then 6;(1) = 52(1). Because the homomorphism 
m(U, bı) > 2)(B, bı) induced by inclusion is trivial, 5, * 52 is path homotopic in B 
to the constant loop. Then [a * 5; ] = [a * 62], so that (œ *5))* = (a * 82)", as desired. 

It follows that p : E — B is a covering map in the sense used in earlier chapters. 
To show it is a covering map in the sense used in this chapter, we must show E is path 
connected. This we shall do shortly. 

Step 5. Lifting a path in B. Let eo denote the equivalence class of the constant 
path at bo; then p(eo) = bo by definition. Given a path a in B beginning at bo, we 
calculate its lift to a path in E beginning at eo and show that this lift ends at a“. 

To begin, given c € [0, I], let a, : Z —> B denote the path defined by the equation 


a-(t)=a(tec) for O<:<l. 


Then a, is the “portion” of œ that runs from a(0) to a(c). In particular, ag is the 
constant path at bo, and æ; is the path æ itself. We define a : 1 —> E by the equation 


a(c) = (a,)* 


498 Classification of Covering Spaces Ch. 13 


and show that @ is continuous. Then @ is a lift of a, since p(a(c)) = a-(1) = a(c); 
furthermore, & begins at (a9)” = eg and ends at (a1)* = af. 

To verify continuity, we introduce the following notation. Given 0 < c < d < 1, 
- let 5¢.q4 denote the path that equals the positive linear map of J onto [c, d] followed 
by a. Note that the paths ag and ac * ôc,a are path homotopic because one is just a 


reparametrization of the other. See Figure 82.2. 


Figure 82.2 


We now verify continuity of à at the point c of [0, 1]. Let W be a basis element 
in E about the point a(c). Then W equals B(U, ac) for some path-connected neigh- 
borhood U of a(c). Choose € > 0 so that for |c — t| < e, the point a(t) lies in U. 
We show that if d is a point of (0, 1] with jc — d] < €, then a(d) € W; this proves 
continuity of & at c. 

So suppose |c — d| < €. Take first the case where d > c. Set ô = ôc,a; then since 
læa] = [æ * ô], we have 


ald) = (ag)* = (ax * 8)". 


Since 5 lies in U, we have a(d) € B(U, ac), as desired. If d < c, set 5 = ôd, and 
proceed similarly. 


Step 6. The map p : E — B is a covering map. We need only verify that E is 
path connected, and this is easy. For if a* is any point of E, then the lift & of the path 
a is a path in E from eg to af. 


Step 7. Finally, H = p,(7\(E, eo). Let a be a loop in B at bo. Let & be its lift 
to E beginning at e9. Theorem 54.6 tells us that {a} € p.(71(E, e9)) if and only if & 
is a loop in E. Now the final point of & is the point a*, and a* = eg if and only if a 
is equivalent to the constant path at bo, i.e., if and only if {a * é,,)] € H. This occurs 
precisely when {a} € H. a 


Corollary 82.2. The space B has a universal covering space if and only if B is path 
connected, locally path connected, and semilocally simply connected. 
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Exercises 


1. Show that a simply connected space is semilocally simply connected. 


2. Let X be the infinite earring in R?. (See Example 1 of §80.) Let C(X) be the 
subspace of R? that is the union of all line segments joining points of X x 0 to 
the point p = (0,0, 1). It is called the cone on X. Show that C(X) is simply 
connected, but is not locally simply connected at the origin. 


*Supplementary Exercises: Topological Properties and 2 


The results of the preceding section tell us that the appropriate hypotheses for classi- 
fying the covering spaces of B are that B is path connected, locally path connected, 
and semilocally simply connected. We now show that they are also the correct hy- 
potheses for studying the relation between various topological properties of B and the 
fundamental group of B. 


1. Let X be a space; let Æ be an open covering of X. Under what conditions does 
there exist an open covering B of X refining A such that for each pair B, B’ 
of elements of B that have nonempty intersection, the union B U B’ lies in an 
element of A? 

(a) Show that such a covering B exists if X is metrizable. [Hint: Choose €(x) 
so B(x, 3€(x)) lies in an element of A. Let B consist of the open sets 


B(x, €(x)).] 

(b) Show that such a covering exists if X is compact Hausdorff. [Hint: Let 
Aj, -.-, An be a finite subcollection of A that covers X. Choose an open 
covering C},..., Cn of X such that Či C A; for each i. For each nonempty 
subset J of {1,..., n}, consider the set 

Bs =() 4; -UG] 
jes j¢J 


2. Prove the following: 

Theorem. Let X be a space that is path connected, locally path connected, 
and semilocally simply connected. If X is regular with a countable basis, then 
1, (X, xo) is countable. 

Proof. Let A be a covering of X by path-connected open sets A such that for 
each A € A and eacha € A, the homomorphism 7) (A, a) > 7)(X, a) induced 
by inclusion is trivial. Let B be a countable open covering of X by nonempty 
path-connected sets that satisfies the conditions of Exercise 1. Choose a point 
p(B) € B foreach B €e B. For each pair B, B’ of elements of B for which 
BOB’ # Ø, choose a path g(B, B’) in B U B’ from p(B) to p(B’). We call the 
path g(B, B’) a select path. 
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Let Bo be a fixed element of B; let xo = p(Bo). Show that if f is a loop in X 
based at xo, then f is path homotopic to a product of select paths, as follows: 
(a) Show that there is a subdivision 


O=10 <--- <= 1 


of (0, 1) such that f maps [t,-1, tn} into Bo. and for each i = 1,...,n— 1, 
f maps [f;-1, ti] into an element B; of B. Set Ba = Bo. 

(b) Let f; be the positive linear map of (0, 1] onto [1;-1, t;] followed by f. Let 
gi = 8(Bi-1, Bi). Choose a path a; in B; from f (ti) to p(B,), ifi = Oor n, 
let a; be the constant path at xo. Show that 


[fi] * (ai) = laii] * fg). 


(c) Show that [f] = [81] «--- * [gn]. 

3. Let p : E — X be a covering map such that x (X, xo) is countable. Show 
that if X is regular with a countable basis, so is E. (Hint: Let B be a countable 
basis for X consisting of path-connected sets. Let C be the collection of path 
components of p~'(B), for B € B. Compare Exercise 6 of §53.] 

4. Prove the following: 

Theorem. Let X be a space that is path connected, locally path connected, 
and semilocally simply connected. If X is compact Hausdorff, then ,(X, xo) is 
finitely generated, and hence countable. 

Proof. Repeat the proof outlined in Exercise 2, choosing 8 to be finite. One has 
the equation 


[f] = [gi] *--- [en], 


as before. Choose, foreach x € X, a path £x from xo to x; let £x, be the constant 
path. If g = g(B, B’), define 


L(g) = Bx * (8 * By), 
where x = p(B) and y = p(B’). Show that 
[F] = [L(81)} * --- + [L(8n)]. 


5. Let X be the infinite earring (see Example 1 of §80). Show that X is a compact 
Hausdorff space with a countable basis whose fundamental group is uncountable. 
(Hint: Let r, : X —> Cp be a retraction. Given a sequence a4, a2, ... of zeros 
and ones, show there exists a loop f in X such that, for each n, the element 
(rn)«{ f] is trivial if and only if a, = 0.} 


Chapter 14 


Applications to Group Theory 


In the preceding chapter, we showed how a problem of topology—classifying all cov- 
ering spaces of a space B—can be reduced to a problem of algebra—classi fying all 
subgroups of the fundamental group of B. Now we consider the reverse process, that 
of reducing a problem of algebra to one of topology. The problem of algebra in ques- 
tion is that of showing that any subgroup of a free group is itself a free group. While 
this statement is certainly believable, it is not one whose proof is obvious. We shall 
proceed by applying the theory of covering spaces to certain topological spaces called 
linear graphs. 


§83 Covering Spaces of a Graph 


We define here the notion of linear graph (introduced earlier in the finite case), and 
prove the basic theorem that any covering space of a linear graph is itself a linear 
graph. 

Recall that an arc A is a space homeomorphic to the unit interval [0, 1]. The end 
points of A are the points p, q corresponding to 0 and | under the homeomorphism; 
they are the unique points of A such that A — p and A — q are connected. The interior 
of an arc A consists of A with its end points deleted. 
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Definition. A linear graph is a space X that is written as the union of a collection of 
subspaces A,, each of which is an arc, such that: 

(1) The intersection Ag N Ag of two arcs is either empty or consists of a single point 

that is an end point of each. 

(2) The topology of X is coherent with the subspaces Aq. 
The arcs Aq are called the edges of X, and their interiors are called the open edges 
of X g neir end points are called the vertices of X; we denote the set of vertices of X 
by X”. 


If X is a linear graph, and if C is a subset of X that equals a union of edges and 
vertices of X, then C is closed in X. For the intersection of C with Ag is closed in 
Aa, since it is either empty, or it equals Ag, or it equals one or both vertices of Aq. It 
follows that each edge of X is a closed subset of X. It also follows that X° is a closed 
discrete subspace of X, since any subset of X° is closed in X. 

In the case of a finite graph, considered earlier, we used the Hausdorff condition 
in our definition in place of condition (2); it followed, in that case, that the topology 
of X was coherent with the subspaces Aq. In the case of an infinite graph, this would 
no longer be true, so we must assume the coherence condition as part of the definition. 
We would assume the Hausdorff condition as well, but it is no longer necessary, for it 
follows from the coherence condition: 


Lemma 83.1. Every linear graph X is Hausdorff; in fact, it is normal. 


Proof. Let B and C be disjoint closed subsets of X. Assume, without loss of gener- 
ality, that every vertex of X belongs either to B or to C. For each a, choose disjoint 
subsets Ug and Va of Ag that are open in Aq, containing B N Ag and C N Ag, respec- 
tively. Let U = J Ug and V = Va. Then U and V contain B and C, respectively. 

We show the sets U and V are disjoint. If x € UNV, then x € Ua N Vg for some 
a # P. This fact implies that Ag and Ag contain the point x, which means that x is a 
vertex of X. This is impossible, for if x € B, then x lies in no set Vg, and if x € C, 
then x lies in no set Va. 

Now we show U and V are open in X. To show U is open, we show that UN Ag = 
Ua for each a. By definition, U N Ag contains Ug. If x is a point of U N Ag notin Ua, 
then x belongs to Ug for some $ # a. Then both Ag and Ag contain x, so that x must 
be a vertex of X. This is impossible, for if x € B, then x € Ux by definition of Ua, 
and if x € C, then x cannot belong to U. a 


EXAMPLE 1. If X is the wedge of the circles Sa, with common point p, then X can be 
expressed as a linear graph. We need merely wnte each S, as a graph having three edges, 
with p as one of its vertices; then X is a union of arcs. To show that the topology of the 
wedge X is coherent with the resulting collection of arcs, we note that if D N Ag is closed 
in Ag for each arc Ag, then DN Sg is the union of three sets of the form D N Ag and so is 
closed in Sg; then D is closed in X by definition. See Figure 83.1. 
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EXAMPLE 2. Let J be a discrete space, and let E = [0,1] x J. Then the quotient 
space X obtained from E by collapsing the set {0} x J to a point p is a linear graph 

The quotient map 1 : E > X is a closed map. For if C is closed in E, then x~! (C) 
equals C U ({0} x J) if C contains a point of {0} x J, and nm! n(C) equals C otherwise. 
In either case, m~!w(C) is closed in £, so that x (C) is closed in X. It follows that x 
maps each space [0, 1] x æ homeomorphically onto its image Ag, so that Ag is an arc. 
The topology of X is coherent with the subspaces Aq because x is a quotient map. See 
Figure 83.2. 


Figure 83.1 Figure 83.2 


Definition. Let X be a linear graph. Let Y be a subspace of X that is a union of edges 
of X. Then Y is closed in X and is itself a linear graph; we call it a subgraph of X. 


To show that ¥ is a linear graph, we need to show that the subspace topology on Y 
is coherent with the set of edges of Y. If the subset D of Y is closed in the subspace 
topology, then D is closed in X, so that D N Ag is closed in Ag for each edge of X, 
and in particular for each edge of Y. Conversely, suppose D N Ag is closed in Ag for 
each edge Ag of Y. We must show that DN Ag is closed in Ag for each edge Ag of X 
that is not contained in Y. But in this case, D N Aq is either empty or a one-point set! 
We conclude that Y has the topology coherent with its set of edges. 


Lemma 83.2. Let X bea linear graph. If C is a compact subspace of X, there exists 
a finite subgraph Y of X that contains C. If C is connected, Y can be chosen to be 
connected. 


Proof. First, note that C contains only finitely many vertices of X. For C N X? isa 
closed discrete subspace of the compact space C; since it has no limit point, it must 
be finite. Similarly, there are only finitely many values of a for which C contains an 
interior point of the edge Ag. For if we choose a point xa of C interior to Ag for 
each index æ for which it is possible to do so, we obtain a collection B = {xq} whose 
intersection with each edge Ag is a one-point set or empty. It follows that every subset 
of B is closed in X, so that B is a closed discrete subspace of C and hence finite. 
Form Y by choosing, for each vertex x of X belonging to C, an edge of X having x 
as a vertex, and adjoining to these edges all edges Aq whose interiors contain points 
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of C. Then Y is a finite subgraph containing C. Note that if C is connected, then Y is 
the union of a collection of arcs each of which intersects C, so that Y is connected. @ 


Lemma 83.3. If X is a linear graph, then X is locally path connected and semilocally 
simply connected. 


Proof. Step 1. We show X is locally path connected. If x € X and x lies interior to 
some edge of X, then within every neighborhood of x is a neighborhood of x homeo- 
morphic to an open interval of R, which is path connected. On the other hand, if x is a 
vertex of X and U is a neighborhood of x, then we can choose, for each edge Aq hav- 
ing x as an end point, a neighborhood V, of x in Ag lying in U that is homeomorphic 
to the half-open interval (0, 1). Then |) Va is a neighborhood of x in X lying in U, 
and it is a union of path-connected spaces having the point x in common. 


Step 2. We show X is semilocally simply connected. Indeed, we show that if 
x € X, then x has a neighborhood U such that x, (U, x) is trivial. 

If x lies interior to some edge of X, then the interior of this edge is such a neigh- 
borhood. So suppose x is a vertex of X. Let Stx denote the union of those edges 
of X that have x as an end point, and let Stx denote the subspace of Stx obtained by 
deleting all vertices other than x. (St x is called the star of x in X.) The set St x is open 
in X, since its complement is a union of arcs and vertices. We show that 7; (St x, x) is 
trivial. 

Let f be a loop in Stx based at x. Then the image set f(/) is compact, so it lies 
in some finite union of arcs of Stx. Any such union is homeomorphic to the union of 
a finite set of line segments in the plane having an end point in common. And for any 
loop in such a space, the straight-line homotopy will shrink it to the constant loop at x. 

a 


Now if x is a vertex of X, it is in fact true that the one-point space {x} is a defor- 
mation retract of St x. But there is a surprising amount of effort required to show that 
the obvious deformation is continuous. One needs the fact that a map 


F : (Stx) x 1 > Stx 


is continuous if its restriction to each subspace Ag x / is continuous. This result 
follows from the pasting lemma in the case where St x is a union of only finitely many 
arcs, but the general result requires one to show that the topology of (Stx) x / is 
coherent with the subspaces Aq x /. This in tum follows from a basic theorem about 
products of quotient maps. (See Exercise 11 of §29.) These considerations do not arise 
if one wishes merely to shrink a loop to a point (rather than shrinking the entire space 
Stx), since any loop lies in the union of a finite number of edges, where there is no 
problem. 

Now we discuss covering spaces of linear graphs. Note that the convention that 
every covering space is assumed to be path connected and locally path connected, 
which we assumed in the last chapter, no longer applies. 
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Theorem 83.4. Let p: E — X be a covering map, where X is a linear graph. If A, 
is an edge of X and B is a path component of p~! (Aa), then p maps B homeomorphi- 
cally onto Ag. Furthermore, the space E is a linear graph, with the path components 
of the spaces p—'(Aq) as its edges. 


Proof. Step 1. We show that p maps B homeomorphically onto Ag. Because the 
arc Ag is path connected and locally path connected, Theorems 53.2 and 80.1 tell us 
that the map po : B — Aa obtained by restricting p is a covering map. Because B is 
path connected, the lifting correspondence @ : zı (Aa, a) > po l (a) is surjective; be- 
cause Ag is simply connected, pg ! (a) consists of a single point. (See Theorem 54.4.) 
Hence po is a homeomorphism. 


Step 2. Because X is the union of the arcs Ag, the space E is the union of the 
arcs B that are the path components of the spaces p~! (Aa). Let B and B’ be path 
components of p`! (Aa) and p! (Ag), respectively, with B Æ B’. We show B and 
B’ intersect in at most a common end point. If Ay and Ag are equal, then B and B’ 
are disjoint, and if A, and Ag are disjoint, so are B and B’. Therefore, if B and B’ 
intersect, Ag and Ag must intersect in an end point x of each; then B N B’ consists of 
a single point, which must be an end point of each. 


Step 3. We show that E has the topology coherent with the arcs B. This is the 
hardest part of the proof. Let W be a subset of E such that W N B is open in B, for 
each arc B of E. We show that W is open in E. 

First, we show that p(W) is open in X. If Aq is an edge of X, then p(W) N Ag 
is the union of the sets p(W N B), as B ranges over all path components of p~! (Aa). 
Each of these sets p(W N B) is open in Ag, because p maps B homeomorphically 
onto Aa; hence their union p(W) N Ag is open in Ag. Because X has the topology 
coherent with the subspaces Ag, the set p(W) is open in X. 

Second, we prove our result in the special case where the set W is contained in 
one of the slices V of p`} (U), where U is an open set of X that is evenly covered 
by p. By the result just proved, we know that the set p(W) is open in X. It follows 
that p(W) is open in U. Because the map of V onto U obtained by restricting p is a 
homeomorphism, W must be open in V, and hence open in E. 

Finally, we prove our result in general. Choose a covering A of X by open sets 
U that are evenly covered by p. Then the slices V of the sets p~'(U), for U € A, 
cover E. For each such slice V, let Wy = WO V. The set Wy has the property that for 
each arc B of E, the set Wy N B is open in B, for Wy N B = (WN B) N (V N B) and 
both W N B and V N B are open in B. The result of the preceding paragraph implies 
that Wy is open in E. Since W is the union of the sets Wy, it also is openinE. E 


Exercises 


1, In the proof of normality of a linear graph X, why did we assume that every 
vertex of X belongs either to B or to C? 
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2. The Euler number of a finite linear graph X equals the number of vertices of X 
minus the number of edges of X. It is in fact a topological invariant of X, as we 
shall see. What is the Euler number of an arc? a circle? a wedge of n circles? 
the complete graph on n vertices? If E is an n-fold covering space of X, how are 
the Euler numbers of E and X related? 


§84 The Fundamental Group of a Graph 


Now we prove the basic theorem that the fundamental group of any linear graph is a 
free group. Henceforth we shall refer to a linear graph simply as a graph. 


Definition. An oriented edge e of a graph X is an edge of X together with an ordenng 
of its vertices; the first is called the initial vertex, and the second, the final vertex, of e. 
An edge path in X is a sequence e}, ..., €n of onented edges of X such that the final 


vertex of e; equals the initial vertex of e:41, fori = 1,..., n — l. Such an edge path is 
entirely specified by the sequence of vertices xo, ..., Xn, where xo is the initial vertex 
of e; and x; is the final vertex of e; fori = 1,..., 2. It is said to be an edge path from 


Xo to xn. It is called a closed edge path if xg = xn. 


Given an onented edge e of X, let fe be the positive linear map of (0, 1) onto e; it 
is a path from the initial point of e to the final point of e. Then, corresponding to the 
edge path e1, ..., €„ from xg to x», one has the actual path 


f= fixa) 


from xo to x,, where f; = fe, it is uniquely determined by the edge path e1, ..., €n. 
We call it the path corresponding to the edge path e,, ..., en. If the edge path is 
closed, then the corresponding path f is a loop. 


Lemma 84.1. A graph X is connected if and only if every pair of vertices of X can 
be joined by an edge path in X. 


Proof. Suppose X is connected. Define x ~ y if there is an edge path in X from x 
to y. For any edge of X, its end points belong to the same equivalence class; let Y, 
denote the umon of all edges whose end points are equivalent to x. Then Y, is a 
subgraph of X and hence is closed in X. The subgraphs Y, form a partition of X into 
disjoint closed subspaces; since X is connected, there must be only one such. 
Conversely, suppose every pair of vertices of X can be joined by an edge path. 
Then they can be joined by an actual path in X. Hence all the vertices of X belong 
to the same component of X. Since each edge is connected, it also belongs to this 
component. Thus X is connected. a 
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Definition. Let e1, ..., €, be an edge path in the linear graph X. It can happen that 
for some i, the oriented edges e; and e;4; consist of the same edge of X, but with 
opposite orientations. If this situation does not occur, then the edge path is said to be 
a reduced edge path. 


Note that if this situation does occur, then one can delete e; and e;4, from the 
sequence of onented edges and still have an edge path remaining (provided the original 
sequence consists of at least three edges). This deletion process is called reducing the 
edge path. It enables one to show that in any connected graph, every pair of distinct 
vertices can be joined by a reduced edge path. See Figure 84.1. 


e, 
— 
e3 


Figure 84.1 


Definition. A subgraph T of X is said to be a tree in X if T is connected and T 
contains no closed reduced edge paths. 


A linear graph consisting of a single edge is a tree. The graph in Figure 84.2 is not 
a tree, but deletion of the edge e would make it a tree. The graph in Figure 84.3 is a 
tree; deletion of the edge A would leave a tree remaining. 


Figure 84.2 Figure 84.3 


Lemma 84.2. IfT isa tree in X, and if A is an edge of X that intersects T in a single 
vertex, then T U A is a tree in X. Conversely, if T is a finite tree in X that consists of 
more than one edge, then there is a tree To in X and an edge A of X that intersects To 
in a single vertex, such that T = Tọ U A. 
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Proof. Suppose T is a tree in X and A is an edge that intersects T in a single vertex. 
Clearly T U A is connected; we show it contains no closed reduced edge paths. Let a 
and b be the end points of A, with {a} = TNA. See Figure 84.3. Suppose xo, ..., x, = 
Xo is the vertex sequence of a closed reduced edge path in T U A. If none of the 
vertices x; equals b, then the edge path lies in T, contrary to hypothesis. If x; = b for 


some i with 0 <i < n, then we must have x;-1 = a and x;4; = a; hence the edge 
path is not reduced, contrary to hypothesis. Finally, if x9 = b = x, and x; 7 b for 
i= 1,...,n — 1, then xı = a and x„-ı = a, and the vertex sequence xi, ..., Xn—1 


specifies 2 a ‘closed reduced edge path in T, again contrary to hypothesis. 

Now let T be a finite tree in X having more than one edge. First, we show that 
some vertex b of T belongs to only one edge of T. If this is not the case, we can 
construct an edge path in T as follows: Begin with a vertex xp of T; then choose an 
edge e; of T having xp as an end point. Orient e} so Xo is its initial vertex. Let x, be 
the other end point of e), and let e2 be an edge of T different from e; having x; as a 
vertex. Orient e2 so x, is its initial vertex. Similarly continue. No two successive terms 
of the sequence e1, e2, ... are opposite orientations of the same edge of T. Since T is 
finite, there must be an index n such that x, = x; for some i < n. Then the sequence 
of vertices x;, Xi41, ..., Xn determines a closed reduced edge path in T, contrary to 
hypothesis. See Figure 84.4. 

Let b be a vertex of T belonging to only one edge A of T, and let To consist of 
all edges of T different from A. Then T = Tọ U A. Because T is connected, Ty must 
intersect A in its other vertex a. We show Tọ is a tree. Clearly Tọ contains no closed 
reduced edge paths, because T contains none. Furthermore, To is connected. For if 
To were the union of two disjoint closed sets C and D, the point a would lie in one 
of them, say C. Then C U A and D would be disjoint closed sets whose union is T, 
contrary to the fact that T is connected. E 


Figure 84.4 


Theorem 84.3. Any tree T is simply connected. 


Proof. We first consider the case where T is a finite tree. If T consists of a single 
edge, then T is simply connected. If T has n edges with n > 1, there is an edge A 
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of T such that T = To U A, where Tọ is a tree with n — 1 edges and Tọ N A isa Single 
vertex. Then Tọ is a deformation retract of T. Since Tọ is simply connected by the 
induction hypothesis, so is T. 

To prove the general case, let f be a loop in T. The image set of f is compact and 
connected, so it is contained in a finite connected subgraph Y of T. Now Y contains 
no closed reduced edge paths, because T contains none. Thus Y is a tree. Since Y is 
finite, it is simply connected. Hence f is path homotopic to a constant in Y. a 


Definition. A tree T in X is maximal if there is no tree in X that properly contains T. 


Theorem 84.4. Let X be a connected graph. A tree T in X is maximal if and only if 
it contains all the vertices of X. 


Proof. Suppose T is a tree in X that contains all the vertices of X. If Y is a subgraph 
of X that properly contains T, we show that Y contains a closed reduced ed ge path; it 
follows that T is maximal. Let A be an edge of Y that is not in 7; by hypothesis, the 
end points a and b of A belong to T. Since T is connected, we can choose a reduced 
edge path ei, ..., én in T froma to b. If we follow this sequence by the edge A, 
onented from b to a, we obtain closed reduced edge path in Y. 

Now let T be a tree in X that does not contain all the vertices of X. We show T is 
not maximal. Let xo be a vertex of X not in T. Since X is connected, we may choose 
an edge path in X from xp to a vertex of T, specified by the sequence of vertices xo, 

--, Xn. Let i be the smallest index such that x; € T. Let A be the edge of X with 
vertices x;_; and x;. Then T U A is a tree in X, by the preceding lemma, and T U A 
properly contains T. a 


Theorem 84.5. If X is a linear graph, every tree Tọ in X is contained in a maximal 
tree in X. 


Proof. We apply Zorn’s lemma to the collection F of all trees in X that contain Tọ, 
strictly partially ordered by proper inclusion. To show this collection has a maximal 
element, we need only prove the following: 


If T’ is a subcollection of T that is simply ordered by proper inclusion, 
then the union Y of the elements of T’ is a tree in X. 


To begin, we note that since Y is a union of subgraphs of X, it is a subgraph of X. 
Second, since Y is a union of connected spaces that contain the connected space To, 
the space Y is connected. 

Finally, we suppose that e}, ..., €n is a closed reduced edge path in Y and derive 
a contradiction. For each i, choose an element T; of J’ that contains e;. Because 7’ 
is simply ordered by proper inclusion, one of the trees Ti, ..., Ta, say Tj, contains 
all the others. But then e;, ..., €n is a closed reduced edge path in Tj, contrary to 
hypothesis. R 
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Now we compute the fundamental group of a graph. We need the following result. 


Lemma 84.6. Suppose X = U U V, where U and V are open sets of X. Suppose 
that U N V is the union of two disjoint open path-connected sets A and B, that a is a 
path in U from the point a of A to the point b of B, and that $ is a path in V from b 
toa. If U and V are simply connected, then the class [a x B] generates n; (X, a). 


Proof. The situation is similar to that of Theorem 59.1, except that U N V has two 
path components instead of one. The proof is also similar. 

Let f be a loop in X based at a. Choose a subdivision 0 = ag < aj <-:- <a, = 
1 of [0, 1] such that for each i, f(a;) € U N V and f maps [a;-;, ai) into either U 
or V. Let f; be the positive linear map of [0, 1} onto [a;—1, a;] followed by f; then 
[f] = (fil*---*(f,). Fori =1,...,2 — 1, choose a path a; in either A or B from 
a or b to f (a;); choose ag and a, to be the constant paths at a. Then set 


8i = aj) * (fi * Qi). 


By direct computation, [f] = [g,] *--- * [g,]. Because g; is a path in U or in V 
with end points in the set {a, b}, and because U and V are simply connected, g; is path 
homotopic either to a constant or to a, $, @, or B. It follows that either [f] is trivial, 
or it equals a positive power of (a « 8] or [£ +ã). Hence [a * £] generates the group 
7 (X, a). See Figure 84.5. a 


Figure 84.5 
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Theorem 84.7. Let X be a connected graph that is not a tree. Then the fundamental 
group of X is a nontrivial free group. Indeed, if T is a maximal tree in X , then the fun- 
damental group of X has a system of free generators that is in bijective correspondence 
with the collection of edges of X that are not inT. 


Proof. Let T be a maximal tree in X; it contains all the vertices of X. Let x9 bea 
fixed vertex of T. For each vertex x of X, choose a path yx in T from x9 to x. Then 
for each edge A of X that is not in T, define a loop g4 in X as follows. Orient A; let 
fa be the linear path in A from its initial end point x to its final end point y; and set 


8A = Yx * (fa * Yy). 


We show that the classes [g4] form a system of free generators for 7 (X, xo). 


Step 1. We first prove the theorem when the edges of X not in T are finite in 
number. We proceed by induction. The induction step is easy, so we consider it first. 

Let Ai, ..., An be the edges of X not in T, where n > 1. Onent these edges and 
let g, denote the loop g4, . For each i, choose a point p; interior to A;. Let 


U=X-pm-.:---—-Pn ad V=X-p. 


Then U and V are open in X, and the space U N V = X — p, —--- — Pn is simply 
connected, since it has T as a deformation retract. Therefore, 7,(X, xo) is the free 
product of the groups 7, (U , x9) and 2,(V, xg), by Corollary 70.3. 

The space U has T U A, as a deformation retract, so 7,(U, xg) is free on the 
generator [g,], as we shall prove in Step 2. The space V has T U A2 U---U A, as 
a deformation retract, so it is free on the generators (22), ..., [gn], by the induction 
hypothesis. It follows from Theorem 69.2 that 2; (X, xo) is free on the generators [g1], 
. ++, [gn]. See Figure 84.6. 


Figure 84.6 


Step 2. We now consider the case where there is only one edge D of X that is not 
in T. This step is more difficult. Orient D. We show 21(X, xo) is infinite cyclic with 
generator (gp). 
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Let ao and a, be the initial and final points of D, respectively. Let us write D as 
the umon of three arcs: Dı, with end points ag and a, D2, with end points a and b, 
and D3, with end points b and a;. See Figure 84.7. Let fi, f2, and f3 be the linear 
paths in D from ao to a, and a to b, and b to a1, respectively. We apply the preceding 
theorem to compute 71 (X, a). 


Figure 84.7 


Choose a point p interior to the arc D2. Set U = D — ag — a, and V = X — p. 
Then U and V are open sets in X whose union is X. The space U is simply connected 
because it is an open arc. And the space V is simply connected because it has the 
tree T as a deformation retract. The space U N V equals U — p; it has two path 
components; let A be the one containing a and let B be the one containing b. Then the 
hypotheses of the preceding lemma are satisfied. The patha = fz is a path in U from 
a to b. If we set yo = Ya and yı = ya. then the path B = (f3 * (yı * (yo * fi) isa 
path in V from b to a. Therefore, 2;(X, a) is generated by the class 


{a = 8] = [f2] * [f3] * [71] * (vo) * [f1]. 
It follows that 7) (X, xo) is generated by §[a x p], where ô is the path Ai * yo froma 
to xp. We compute this path-homotopy class as follows: 
Slo + B] = [yo* fil * la * B} * [fi * 7o] 
= [m] * [fi * (f2 * A+) 
= [yo] * [fo] * [A] 
= [go]. 


Therefore, [gp] generates 7z; (X, xo). 
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It remains to show that the element [gp] has infinite order, so that xı (X, xo) is 
infinite cyclic. One can apply Theorem 63.1 (which we used in proving the Jordan 
curve theorem), which states that [œ * 8] has infinite order in x; (X, a). Altematively 
(and more easily), one can consider the map x : X > S! that collapses the tree T to 
a single point p and maps the open arc Int D homeomorphically onto S! — p. Then 
xt o yp and z o yj are constant paths, so that 


.([gp]) = [x o fo). 


This class generates 7; (s', p). It follows that [gp] has infinite order in 2,(X, xo). 


Step 3. Now we consider the situation where the collection of edges of X notin T 
is infinite. The proof in this case is so similar to the corresponding proof for an infinite 
wedge of circles that we omit the details. (See Theorem 71.3.) The crucial facts are 
these: Any loop in X based at xo lies in the space 


X (a1, ---, an) = T U Aa, Ue U Ag, 
for some finite set of indices a;, and any path homotopy between such loops also lies 
in such a space. By this means the general case is reduced to the finite case. E 
Exercises 


1. Give an example to show that the second part of Lemma 84.2 need not hold if T 
is infinite. 

2. What is the cardinality of a system of free generators for the fundamental group 
of the complete graph on n vertices? of the utilities graph? (See §64.) 


3. Let X be the wedge of two circles; let p : E —> X be a covering map. The 
fundamental group of E maps isomorphically under p, onto a subgroup H of 
the fundamental group of X; the latter is free on two generators «œ and f. 

(a) For each of the four covering spaces E given in Exercise 2 of §81, determine 
the cardinality of a system of free generators for the fundamental group of E. 

(b) For each of these covering spaces, find, in terms of a and £, a system of free 
generators for the subgroup H of the fundamental group of X. 
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We now prove our main theorem, to the effect that a subgroup H of a free group F is 
free. The method of proof, remarkably enough, will give us some information about 
the cardinality of a system of free generators for H, when the cardinality of a system 
of free generators for F is known. 
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Theorem 85.1. If H is a subgroup of a free group F, then H is free. 


Proof. Let {a | a € J} be a system of free generators for F. Let X be a wedge of cir- 
cles Sa, one for each a € J; let xq be their common point. We can give X the structure 
of a linear graph by breaking each circle Sq into three arcs, two of which have xo as an 
end point. The function that assigns to each a, a loop generating 71 (Sq, x), induces 
an isomorphism of F with 7; (X, xo). Therefore we may as well assume that F equals 
the group 7;(X, xo). 

The space X is path connected, locally path connected, and semilocally simply 
connected. Therefore Theorem 82.1 applies to show that there exists a path-connected 
covering space p : E —> X of X such that, for some point eo of p~!(x9), 


Ps(™1(E, €0)) = H. 


Since p, is a monomorphism, 21(£, eo) is isomorphic to H. 
The space E is a linear graph, by Theorem 83.4. Then Theorem 84.7 implies that 
its fundamental group is a free group. a 


Definition. If X is a finite linear graph, we define the Euler number of X to be the 
number of vertices of X minus the number of edges. It is commonly denoted by the 
Greek letter chi, as x (X). 


Lemma 85.2. If X is a finite, connected linear graph, then the cardinality of a system 
of free generators for the fundamental group of X is 1 — x(X). 


Proof. Step I. We first show that for any finite tree T, we have x(T) = 1. We 
proceed by induction on the number n of edges in T. If n = 1, then T has one edge 
and two vertices, so x(T) = 1. Ifn > 1, we can wnte T = Tọ U A, where Tọ is a 
tree having n — 1 edges, and A is an edge that intersects Tọ in a single vertex. We have 
X(To) = | by the induction hypothesis. The graph T has one more edge and one more 
vertex than To; hence x(T) = x(To). 

Step 2. We prove the theorem. Given X, let T be a maximal tree in X. If X =T, 
we are finished. Otherwise, let A,,..., An be the edges of X that are not in T. Then 
the fundamental group of X has a system of n free generators. On the other hand, X 
and T have exactly the same vertex set, and X has n more edges than T. Hence, 


X(X) =x(T) -n=1-4, 
so thatn = l — x (X). a 
Definition. Let H be a subgroup of the group G. If the collection G/H of nght 


cosets of H in G is finite, its cardinality is called the index of H in G. (The collection 
of left cosets of H in G has the same cardinality, of course.) 
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Theorem 85.3. Let F be a free group with n + 1 free generators; let H be a subgroup 
of F. If H has index k in F, then H has kn + 1 free generators. 


Proof. We apply the construction given in the proof of Theorem 85.1. We can assume 
that F = zı (X, xo), where X is a linear graph whose underlying space is a wedge of 
n + | circles. Given H, we choose a path-connected covering space p : E — X such 
that p(x (E, e0)) = H. Now the lifting correspondence 


$: 1(X, x9)/H > p~ (xo) 


is a bijection. Therefore, E is a k-fold covering of X. 

The space E is also a linear graph. Given an edge A of X, the path components 
of p`! (A) are edges of E, and each is mapped by p homeomorphically onto A. Thus 
E has k times as many edges as X, and k times as many vertices. It follows that 
X(E) = kx(X). Since the fundamental group of X has n + | free generators, the 
preceding lemma tells us that x (X) = —n. Then the number of free generators of the 
fundamental group of E, which is isomorphic to H, is 


1—-x(E) =1—-—kx(X) = 1 +kn. a 


Note that if F is a free group with a finite system of free generators and H is a 
subgroup of F such that F/H is infinite, then nothing can be said about the cardinality 
of a system of free generators for H. It might be finite (for instance, if H is the trivial 
subgroup) or infinite (for instance, if H is the fundamental group of the covering space 
pictured in Example 2 of §81). 


Exercises 


1. Show that the Euler number of a finite linear graph X is a topological invariant 
of X. (Hint: First consider the case where X is connected.] 


2. Let F be a free group on two free generators a and 8. Let H be the subgroup 
generated by a. Show that H has infinite index in F. 


3. Let p : R -> S! be the standard covering map; consider the covering map 
px p:RxR- S! x S!. Let bp = (1,0) € S!; set X = (bg x SH)U (S! x bo); 
let E = (px p)7!(X); and let g : E —> X be the covering map obtained by 
restricting p x p. The fundamental group of X has free generators a and £, 
where a is represented by a loop in bọ x S! and £ by a loop in S! x bo. Finda 
system of free generators for the subgroup q, (71 (E, e9)), where eg is the origin 
in R4. 
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Baire space (cont.) 
open subspace of Baire space, 297 
R/ in box, product, uniform topolo- 
gies, 300 
Bail, unit, 135, 156 (see also B”) 
Barber of Seville paradox, 47 
Base point, 331 
Base point choice: 
effect on h,, 335 
effect on z4, 332 
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for a free abelian group, 411 
for a topology, 78, 80 
Bd A, 102 
B(X) (see Stone-Cech compactifica- 
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Betti number, 424 
Bicompactness, 178 
Bijective function, 18 
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Bing metrization theorem, 252 
Bisection theorem, 358, 359 
B”, 156 
compactness, 174 
fundamental group, 331 
path connectedness, 156 
Borsuk lemma, 382, 385 
Borsuk-Ulam theorem, 358, 359 
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of a set, 102 
of a surface with boundary, 476 
Bounded above, 27 
Bounded below, 27 
Bounded function, 267 
Bounded metric, 121, 129 
Bounded set, 121 
Box topology, 114 
basis for, 115, 116 
Hausdorff condition, 116 
subspace, 116 
vs. fine topology, 290 
vs. product topology, 115 
vs. uniform topology, 124, 289, 290 
Brouwer fixed-point theorem, 351, 353 


B?, 135 (see also B”) 
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comparability, 68 
of a finite set, 39, 42 
greater, 62 
same, 51 
Cartesian product: 
countably infinite, 38 
finite, 13, 37 
general, 113 
Cauchy integral formula, 405 
Cauchy sequence, 264 
C(E, p, B), 487 (see also Group of 
covering transformations) 
Choice axiom (see Axiom of choice) 
Choice function, 59 
Circle, unit (see S!) 
Classification: 
of covering spaces, 482 
of covering transformations, 488 
of surfaces, 469 
Clockwise loop, 405 
Closed edge path, 566 
Closed graph theorem, 171 
Closed interval, 84 
Closed map, 137 
Closed ray, 86 
Closed refinement, 245 
Closed topologist’s sine curve, 381 
separates S?, 381, 393 
Closed set, 93 
in subspace, 94 
vs. limit points, 98 
Closure, 95 
in a cartesian product, 101, 116 
of a connected subspace, 150 
in a subspace, 95 
of a union, 101, 245 
via basis elements, 96 
via nets, 187 
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via sequences, 130, 190 
via limit points, 97 
Coarser topology, 77 
Cofinal, 187 
Coherent topology, 224, 435 
Collection, 12 
Commutator, 422 
Commutator subgroup, 422 
Compact, 164 (see also Compact 
Hausdorff space, Compactness) 
Compact convergence topology 283 
convergent sequences in, 283 
independence of metric, 286 
vs. compact-open topology, 285 
vs. pointwise convergence topology, 
285 
vs. uniform topology, 285 
Compact Hausdorff space: 
is Baire space, 296 
components equal 
quasicomponents, 236 
metrizability, 218 
normality, 202 
paracompactness, 252 
Compactification, 185, 237 
induced by an imbedding, 238 
one-point, 185 
of (0, 1), 238 
Compactly generated space, 283 
Compactness, 164 (see also Compact 
Hausdorff space) 
of closed intervals in R, 173 
closed set cnterion, 169 
of continuous image, 166 
of countable products, 280 
in C(X, R”), 278, 279 
in C(X, Y), 290, 293 
in finite complement topology, 166 
of finite products, 167 
in Hausdorff metric, 281 
and least upper bound property, 172 
in order topology, 172 
and perfect maps, 172 
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of product space, 234, 236 
in R and R”, 173 
of subspace, 164 
via nets, 188 
vs. completeness, 276 
vs. limit point compactness, 179 
vs. sequential compactness, 179 
Compact-open topology, 285 
continuity of evaluation map, 286 
vs. compact convergence topology, 
285 
Comparability: 
of cardinalities, 68 
of topologies, 77 
of well-ordered sets, 73 
Comparison test for infinite senes, 135 
Complement, 10 
Complete graph, 394 
on five vertices, 308, 397 
Completely normal space, 205 
Completely regular space, 211 (see 
also Complete regularity) 
Complete metric space, 264 (see also 
Completeness) 
Completeness: 
and Baire condition, 296 
of B(X, Y) in uniform metric, 267 
of closed subspace, 264 
of @(X, Y) in uniform metric, 267 
of C(X, Y) in sup metric, 268 
of £, 271 
of R", 265 
of R”, 265 
of R/ in uniform metric, 267 
of Y* in compact-open topology, 
289 
of Y7 in uniform metric, 267 
vs. compactness, 276 


Complete regularity, 211 
of locally compact Hausdorff space, 
213 
of product space, 211 
of RJ in box topology, 213 
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Complete regularity (cont.) 
of RZ, 212 
of Sg x Se, 212 
of subspace, 211 
of topological group, 213 
vs. normality, 211, 212 
vs. regularity, 214 
Complete set of relations, 425 
Completion, 269 
uniqueness, 271 
Component, 159 
of R® in box topology, 162 
of R® in uniform topology, 162 
vs. path component, 161 
vs. quasicomponent, 163, 236 
Composite: 
of functions, 17 
of continuous functions, 107 
of covering maps, 341, 485, 487 
of quotient maps, 141 
Conclusion, 7 
Cone, 499 
Conjugacy class, 481 
Conjugate elements, 419 
Conjugate subgroups, 481 
Connected component, 159 
Connectedness, 148 
in box topology, 151 
of closure, 150 
of continuous image, 150 


in finite complement topology, 152 


of finite products, 150 

in a linear continuum, 153 

of long line, 159 

of ordered square, 156 

of a product space, 152 

of Rx, 177 

of R”, 151 

of subspace, 148 

of topologist’s sine curve, 156 

vs. path connectedness, 156 
Connected sum: 

of projective planes, 452 


of tori, 451 
Connected space, 148 (see also Con- 
nectedness) 
Constant path, 327 
Contains, 4 
Continuity: 
of algebraic operations in R, 131, 
135 
basis criterion, 103 
and change of range, 108 
and closedness of graph, 171 
closed set cnterion, 104 
closure criterion, 104 
of composites, 107 
of constant function, 107 
€-6 formulation, 129 
of inclusion, 107 
local formulation, 108 
of maps from quotient spaces, 142 
of maps into products, 110, 117 
of metric, 126 
of min{ f, g}, 112 
at a point, 104 
of products of maps, 112 
of restriction, 108 
subbasis criterion, 103 
of uniform limit, 132 
in variables separately, 112 
via nets, 188 
via sequences, 130, 190 
Continuous function, 102 (see also 
Continuity) 
Continuous image: 
of a compact space, 166 
of a connected space, 150 
of a Lindelof space, 194 
of a space with a countable dense 
subset, 194 
Continuum hypothesis 62, 205 
Contractible space, 330 
homotopy type, 366 
Contraction, 182, 270 
and fixed points, 182 
vs. Shrinking map, 182 


Contrapositive, 8 
Convergent net, 187 
Convergent sequence, 98 
in compact convergence topology, 
283 
in Hausdorff space, 99 
in point-open topology, 282 
in a product space, 118, 265 
Converges uniformly, 131 
Converse, 9 
Convex set: 
in an ordered set, 90, 153 
in R”, 325 
Coordinate: 
of J-tuple, 113 
of m-tuple, 37 
of w-tuple, 38 
Coordinate function, 110 
Coset, 146, 330 
Countable basis, 190 (see also Second- 
countability) 
Countable basis at a point, 130, 190 
(see also First-countability) 
Countable compactness, 181 
Countable dense subset, 192 
effect of continuous function, 194 
in R’, 195 
in RY, 195 
in Re, 192 
in subspace, 194 
Countable intersection property, 235 
Countable set, 45 (see also Countabil- 
ity) 
Countability, 45 
of algebraic numbers, 51 
of countable unions, 48 
of finite products, 49 
of rationals, 48 
of subsets, 48 
via injective and surjective maps, 45 
of Z, 44 
of Z, x Z,, 45, 48 
Countably infinite, 44 
Countably locally discrete, 252 
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Countably locally finite, 245 
Counterclockwise loop, 404 
Counterimage, 19 
Covering, 164 
of subspace, 164 
Covering dimension, 305 
Covering map, 336 
composite, 341, 485, 487 
is local homeomorphism, 338 
is open, 336 
products of, 339 
restrictions, 338, 484 
Covering space, 336 
classification, 482 
equivalence, 478 
existence, 495 
of figure eight, 340, 374, 375, 492, 
493 
k-fold, 341 
of linear graph, 505 
of P?, 372 
regular, 489 
of R? — 0, 340 
of S', 337, 338, 482 
topological properties, 341, 500 
of torus, 339, 483 
universal, 484 
Covering transformation, 487 
Cube in R”, 314 
Curve, 225 
simple closed, 379 
Curved triangle, 471 
Cutting a region apart, 458 
CW complex, 445 
Cyclic group, 346 


D 
d, 121 
Decomposition space, 139 
Deformation retract, 361 

fundamental group of, 361 
Deformation retraction, 361 

vs. homotopy equivalence, 365, 366 
Degree of a map, 367 
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DeMorgan’s laws, 11 
Dense subset, 191 
Diagonal, 101 
Diameter of a set, 121 
Dictionary order, 26 
Difference of two sets, 10 
Dimension, topological, 305 (see also 
Topological dimension) 
Directed set, 187, 188 
Direct sum, 408, 409 
existence, 409 
extension condition, 408, 410 
uniqueness, 410 
Discrete topology, 77 
metric for, 120 
Disjoint sets, 6 
Distance, 119 
Distance from x to A, 175 
Distributive laws for |_] and fì, 11 
Domain, 16 
Double torus, 374, 452 
fundamental group, 374 
Doubly punctured plane, 362 
Dunce cap, 443 
fundamental group, 444 
d(x, A), 175 


E 
Edge: 
of curved triangle, 471 
of a linear graph, 308, 394, 502 
of a polygonal region, 447 
Edge path, 506 
reduced, 507 
Element of a set, 4 
Elementary divisors, 424 
Elementary operations on schemes, 
460 
Empty interior, 295 
Empty set, 6 
End points of arc, 308, 378 
Epimorphism, 330 
e€-ball, 119 
€-neighborhood of a set, 177 


Equality symbol, 4 
Equicontinuity, 276 
vs. compactness, 278, 279 
vs. total boundedness, 277 
Equivalence class, 22 
Equivalence of compactifications, 237 
Equivalence of covering maps, 478 
existence, 480, 482 
Equivalence of labelling schemes, 461 
Equivalence relation, 22 
Euclidean metric, 122, 128 
Euclidean space, 38 
Euler number, 506, 514, 515 
Evaluation map, 271, 286 
Evenly covered, 336 
Eventually zero, 51 
ex (constant path), 327 
Expansion lemma, 260 
Extension condition: 
for direct sums, 408, 410 
for free abelian groups, 411 
for free groups, 421 
for free products, 414, 418, 419 
External direct sum, 409 
Extemal free product, 415 
Extreme value theorem, 174 


F 
[f], 324 
Family of sets, 36 
f * 8,326 
Fibonacci numbers, 56 
Field, 31 
Figure-eight space, 340, 362 
covering space, 340, 374, 375, 492, 
493 
fundamental group, 373 
Final point: 
of oriented line segment, 447, 506 
of path, 323 
Finer topology, 77 
basis criterion, 81 
Fine topology, 289 
is Baire, 300 


Finite axiom of choice, 61 
Finite complement topology, 77 
compactness, 166 
connectedness, 152 
Finite dimensional, 305 
Finite intersection property, 169 
Finitely generated group, 421 
Finitely presented group, 425 
as fundamental group, 445 
Finiteness: 
of cartesian products, 43 
of subsets, 43 
of unions, 43 
vs. injective and surjective maps, 43 
Finite presentation, 425 
Finite set, 39 
First category set, 295 
First coordinate, 13 
First-countability, 131, 190 
implies compactly generated, 283 
of metric space, 131 
of product, 191 
of subspace, 191 
of Rg, 192 
First-countable space 
(see First-countability) 
First homology group, 455 
of m-fold projective plane, 456 
of n-fold torus, 456 
First homotopy group (see Fundamen- 
tal group) 
Fixed point, 158, 182 
Fixed-point-free action, 493 
Fixed point theorem: 
for B”, 353 
for B?, 351 
for a contraction, 182, 270 
for retract of B?, 353 
for a shrinking map, 182 
for [0, 1], 158 
Free abelian group, 411 
extension condition, 411 
rank, 411 
subgroup is free abelian, 412 
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Free generators for a group, 421 
Free group, 421 
extension condition, 421 
on a set, 422 
subgroup is free, 514 
Free homotopy of loops, 403 
Free product, 413 
existence, 415 
extension condition, 414, 418, 419 
extemal, 415 
uniqueness, 418 
Frobenius theorem, 351 
Fs set, 252 
Function, 16 
Functor, 242 
Functorial properties of h,, 334 
Fundamental group, 331 
of dunce cap, 444 
of deformation retract, 361 
of double torus, 374, 452 
of figure eight, 362, 373, 434 
of infinite earring, 500 
of linear graph, 511 
of m-fold projective plane, 453 
of n-fold torus, 452 
of a product, 371 
of P?, 373 
of R” — 0, 360 
of S!, 345 
of S”, 369 
of theta space, 362, 432 
of torus, 371, 442 
of wedge of circles, 434, 436 
of wedge of spaces, 438 
when abelian, 335 
when countable, 499, 500 
when finitely generated, 500 
when uncountable, 500 
Fundamental theorem of algebra, 354 


G 
Gs set, 194, 249 
Generalized continuum hypothesis, 62 
General lifting lemma, 478 
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General linear group, 146 
General nonseparation theorem, 390 
General position, 308, 310 
General separation theorem, 380, 392 
Generated: 
by elements, 411, 421. 
by subgroups, 407, 412 
Generator of cyclic group, 346 
Geometrically independent, 309 
[G, G], 422 
G/H, 146, 331 
regularity, 146 
as topological group, 146 
Graph of a function, 171 
Greater cardinality, 62 
Greatest lower bound, 27 
property, 27 
Group of covering transformations, 
487 
Groupoid properties 326 


H 
hy, 333 

dependence on base point, 335 

functorial properties, 334 
Hahn-Mazurkiewicz theorem, 275 
Half-open interval, 84 
Ham sandwich theorem, 359 
Hausdorff condition, 98 

for box topology, 116 

and closedness of diagonal, 100 

and convergent sequences, 99 

for manifold, 227 

for metric space, 129 

for orbit space, 199 

for order topology, 100 

and perfect maps, 199 

for product space, 100, 116, 196 

for quotient space, 142 

for subspace, 100, 196 

for topological group, 146 

and uniqueness of extensions, !12, 

240 
vs. regularity, 195, 197 


vs. Tı axiom, 99 
Hausdorff maximum principle, 69 
Hausdorff metric, 281 
Hausdorff space, 98 (see also Haus- 
dorff condition) 
Have the same cardinality, 51 
Hilbert cube, 128 
Homeomorphism, 105 
vs. continuous bijective map, 106, 
167 
Homogeneous space, 146 
Homology group, 455 
Homomorphism, 330 
induced by a map, 333 (see also h,) 
induced by a path, 331 (see also &) 
Homotopic maps, 323 
Homotopy, 323 
effect on h,, 360, 363, 364 
as path in function space, 288 
straight-line, 325 
Homotopy equivalence, 363 
induces isomorphism of 71, 364 
vs. deformation retraction, 365, 366 
Homotopy extension lemma, 381 
Homotopy inverse, 363 
Homotopy type, 363 
of contractible space, 366 
H(X) (see First homology group) 
Hypothesis, 7 


I 
I? (see Ordered square) 
Identification space, 139 
Identity function, 21 
“If... then,” meaning of, 7 
Image, 16, 19 
Imbedding, 105 
isometric, 133 
Imbedding theorem: 
for a compact manifold, 226, 314 
for a completely regular space, 217 
for a linear graph, 308 
for a manifold, 316 
for a space of dimension m, 311 


Immediate predecessor, 25 
Immediate successor, 25 
Inclusion, 4 
Indexed family of sets, 36 
Indexing function, 36 
Index set, 36 
Index of a subgroup, 514 
Indiscrete topology, 77 
Induction principle, 32 
strong, 33 
transfinite, 67 
Inductive definition, 47 (see also Re- 
cursive definition) 
Inductive dimension, 315 
Inductive set, 32, 67 
Inf A, 27 
Infimum, 27 
Infinite broom, 162 
Infinite earring, 436 
fundamental group, 500 
no universal covering, 487 
Infinite sequence, 38 
Infinite series, 135 
Infinite set, 44 
via injective and bijective functions, 
57 
Initial point: 
of an oriented line segment, 447, 
506 
of a path, 323 
Injective function, 18 
Int A, 95 
Integers, 32 
Interior point: 
of an arc, 379 
of a set, 95 
Intermediate-value theorem, 147, 154 
Intersection, 6, 12, 36 
Interval, 25, 84 
Intervals in R: 
compactness, 173 
connectedness, 154 
topological dimension, 305 
Invariance of domain, 383, 385 
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Inverse function, 18 
Inverse image, 19 
Isolated point, 176 
Isometric imbedding, 133 
in complete metric space, 269, 271 
Isometry, 181 
Isomorphism, 105, 330 


J 
Jordan curve theorem, 390 
Jordan separation theorem, 379 
J-tuple, 113 


K 
Kerel of homomorphism, 330 
k-fold covering, 341 
Klein bottle, 454 
k-plane, 310 
K-topology on R, 82 (see also Rx) 
Kuratowski 14-set problem, 102 
Kuratowski lemma, 72 


L 
Labelling, 447 
Labelling scheme, 449 
(see also Scheme) 
Labels, 447 
Larger topology, 77 
Largest element, 27 
Least normal subgroup, 419 
generators, 420 
Lebesgue number, 175 
Lebesgue number lemma, 175 
Least upper bound, 27 
Least upper bound property, 27 
and compactness, 172, 177 
and local compactness, 183 
for R, 31 
for well-ordered sets, 66 
vs. greatest lower bound property, 29 
Left coset, 146, 330 
Left inverse, 21 
Length of a word, 412 
Lens space, 494 
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Lifting, 342 
Lifting correspondence, 345 
Lifting lemma: 
general, 478 
for path homotopies, 343 
for paths, 342 
Limit point, 97 
vs. Tı axiom, 99 
Limit point compactness, 178 
vs. compactness, 179 
vs. countable compactness, 181 
Limit of a sequence, 100 
Lindelof condition, 192 (see also Reg- 
ular Lindelof space) 
for closed subspace, 194 
effect of continuous function, 194 
for products, 193 
for Re, 192 
for R?, 193 
for subspace, 193 
Linear continuum, 31, 153 
compact subspaces, 172 
connected subspaces, 153 
long line, 158 
normality, 206 
ordered square, 155 
Linear graph, 308, 394, 502 
covering space of, 505 
fundamental group, 511 
imbedding in R?, 308 
local path connectedness, 504 
local simple connectedness, 504 
semilocal simple 
connectedness, 504 
topological dimension, 308 
Linear order, 24 
Line with two origins, 227 
Little ell-two topology, 128 
Local compactness, 182 
implies compactly generated, 283 
and least upper bound property, 183 
for orbit space, 199 
and perfect maps, 199 
of products, 186 


of R and R” and R®, 182 
of subspace, 185 
Local connectedness, 161 
of quotient space, 163 
vs. weak local connectedness, 162 
Local homeomorphism, 338 
Locally compact Hausdorff space: 
Baire condition, 299 
complete regularity, 213 
regularity, 205 
Locally discrete, 254 
Locally euclidean, 316 
Locally finite collection, 244 
Locally finite family, 112 
vs. locally finite collection, 245 
Local metrizability, 218, 261 
Local path connectedness, 161 
Local simple connectedness, 495 
vs. simple connectedness, 499 
Logical equivalence, 8 
Logical quantifiers, 9 
Long line, 158, 317 
connectedness, 159 
path connectedness, 159 
Loop, 331 
Lower bound, 27 
Lower limit topology, 82 (see also Re) 
£?-topology, 128 


M 
Manifold 225, 316 
imbedding in R”, 226, 314, 316 
metrizability, 227 
necessity of Hausdorff condition, 
227 
regularity, 227 
topological dimension, 314, 316 
Mapping, 16 
Maximal element, 70 
Maximal tree, 509 
Maximum principle, 69 
vs. well-ordering theorem, 73 
vs. Zorn’s lemma, 70, 72 


Maximum value theorem: 
of calculus, 147 
general, 174 
Metric, 119 
bounded, 129 
for discrete topology, 120 
for R, 120 
for R”, 122 
for R®, 125 
Metrically equivalent, 270 
Metric space, 120 
Hausdorff condition, 129 
normality, 202 
paracompactness, 257 
subspace, 129 
Metric topology, 119 
Metrizable space, 120 
Metrizability: 
of compact Hausdorff space, 218 
of manifolds, 227 
of ordered square, 194 
of products, 133, 134 
of regular Lindelöf space, 218 
of regular second-countable space, 
215 
of RY, 133 
of Rz, 194 
of R”, 123 
of R®, 125, 132 
of Sq and Sg, 181 
of Stone-Cech compactification, 242 
m-fold projective plane, 452 
first homology group, 456 
fundamental group, 453 
Minimal uncountable well-ordered set, 
66 (see also SQ) 
Mobius band, 450 
Monomorphism, 330 
m-tuple, 37 


N 
Nagata-Smirnov metrization theorem, 
250 
Negation, 9 
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Neighborhood, 96 
Nested sequence of sets, 170 
Net, 187 
n(f, a) (see Winding number) 
n-fold torus, 451 
first homology group, 456 
fundamental group, 452 
Nonseparation theorem: 
arc in S?, 389 
general, 390 
topologist’s sine curve in $2, 393 
No-retraction theorem, 348 
Norm, 122 
Normality, 195 
of adjunction space, 224 
of closed subspace, 205 
of coherent topology, 224 
of compact Hausdorff space, 202 
of linear continuum, 206 
of linear graph, 502 
of metric space, 202 
of orbit space, 199 
of product, 198, 203 
of paracompact Hausdorff 
space, 253 
of quotient space, 199, 443 
of regular Lindeloff space, 205 
of regular second-countable space, 
200 
of Rz, 198 
of R7, 203 
of subspace, 203 
of topological group, 207 
vs. complete regularity, 211, 212 
vs. regularity, 195, 198, 203 
of well-ordered set, 202 
Normalizer, 487 
Normal space, 195 (see also Normal- 
ity) 
Normal subgroup, 330 
Nowhere-differentiable function, 300 
Nulhomotopy lemma, 377 
Nulhomotopic map, 323 
induces trivial homomorphism, 364 
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w-tuple, 38 
One-point compactification, 185 
uniqueness, 183 
One-to-one correspondence, 18 
“Onto” function, 18 
Open covering, 164 
Open interval, 25, 84 
Open map, 92, 137 
Open ray, 86 
Open refinement, 245 
Open set, 76 
relative to subspace, 89 
Operation, binary, 30 
Operation on schemes, 460 
Orbit, 490 
Orbit space, 199, 490 
Order of a covering, 305 
Ordered field, 31 
Ordered pair, 13 
Ordered square, 90 
connectedness, 156 
is linear continuum, 155 
metnizability, 194 
path connectedness, 156 
Order of a group, 346 
Order of a group element, 412 
Order relation, 24 
Order topology, 84 
compact subspaces, 172 
normality, 202, 206 
Hausdorff condition, 100 
subbasis, 86 
vs. subspace topology, 91 
Order type, 25 
Oriented edge of a graph, 506 
Oriented line segment, 447 
“Or,” meaning of, 5 


P 

P(A), 12 

Paracompactness, 253 
of compact Hausdorff space, 252 
of closed subspace, 254 


of metric space, 257 

and perfect maps, 260 

of regular Lindelöf space, 257 
of R”, 253 

of R” in box topology, 260 
of R”, 257 

of R”, 257 

of Sp, 260, 261 

of topological groups, 261 

vs. normality, 253 


Paracompact space, 253 (see also Para- 


compactness) 

Partial order, 71 

axioms, 187 

strict, 68 
Partition of a set, 23 
Partition of unity, 225, 258 

existence, 225, 259 
Pasting lemma, 108 
Pasting edges together, 448 
Pasting regions together, 458 
Path, 155 

corresponding to edge path, 506 
Path component, 160 

vs. component, 161 
Path connectedness, 155 

of B”, 156 

of long line, 159 

of ordered square, 156 

of R” — 0, 156 

of S", 156 

of topologist’s sine curve, 157 

vs. connectedness, 156 
Path homotopy, 323 
Path-homotopy class, 324 
Path-induced homomorphism, 331 
Peano curve, 271 
Peano space, 275 
Perfectly normal space, 213 
Perfect map, 172, 199 

and compactness, 172 

and paracompactness, 260 
Piecewise linear function, 302 


7\(X, xo), 331 (see also Fundamental 
group) 
Plane in R”, 310 
Point-finite collection, 248 
Point-finite family, 227 
Point-open topology, 281 
convergent sequences in, 282 
equals product topology, 282 
vs. compact convergence topology, 
285 
vs. compact-open topology, 285 
Pointwise bounded, 278 
Pointwise convergence topology, 281 
(see also Point-open topology) 
Polygonal region, 447 
Positive integers, 32 
Positive linear map: 
of intervals in R, 328 
of oriented line segments, 447 
Power set, 12 
Precise refinement, 258 
Preimage, 19 
Presentation of a group, 425 
Principle of induction, 32 
transfinite, 67 
Principle of recursive definition, 47, 54 
general, 72 
Product: 
of continuous maps, 112 
of covering maps, 339 
of open maps, 141 
of path-homotopy classes, 326 
of paths, 326 
of quotient maps, 141, 143, 145, 
186, 289 
Product space, 114 (see also Product 
topology) 
fundamental group, 371 
Product topology, 86, 114 
basis, 86, 115, 116 
closures in, 101, 116 
compactness, 167, 234 
complete regularity, 211 
connectedness, 150, 152 
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convergent sequences, 118, 265 
first-countability, 191 
Hausdorff condition, 100, 1 16, 196 
Lindelof condition, 193 
local compactness, 186 
metrizability, 133, 134 
normality, 198, 203 
paracompactness, 257 
regularity, 196 
second-countability, 191 
subbasis, 88, 114 
vs. box topology, 115 
vs. point-open topology, 282 
vs. quotient topology, 141, 143. 145, 
186, 289 
vs. subspace topology, 89, 116 
vs. uniform topology, 124 
Projection map, 87, 114 
is open map, 92 
Projective n-space, 373 
Projective plane, 372 (see also P?) 
Projective-type scheme, 463 
Proper inclusion, 4 
Proper labelling scheme, 463 
Proper subset, 4 
Properly discontinuous, 490 
Prüfer manifold, 317 
P?, 372 
fundamental group, 373 
is surface, 372 
Punctured euclidean space, 156 (see 
also R" — 0) 
Punctured plane, 325 (see also R? — 0) 


Q 
Q”, 195 
Quantifiers, logical, 9 
Quasicomponent, 163 
vs. component, 163, 236 
Quotient group, 331 
Quotient map, 137 
composites, 141 
products, 141, 143, 145, 186, 289 
restrictions, 137, 138, 140 
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Quotient space, 139 (see also Quotient 
topology) 
Quotient topology, 138 
and continuous functions, 142 
Hausdorff condition, 142, 199 
local compactness, 199 
local connectedness, 163 
normality, 199 
regularity, 199 
second-countability, 199 
Tı condition, 141 
vs. product topology, 141, 143, 145, 
186, 289 


R 
R (reals), 30 
algebraic properties, 30 
compact subspaces, 173 
connected subspaces, 154 
local compactness, 182 
K -topology, 82 (see also Rx) 
lower limit topology, 82 (see also 
Ri) 
metric for, 120 
order properties, 31 
second-countability, 190 
standard topology, 81 
uncountability, 177 
R4,31 
Range of a function, 16 
Rank of a free abelian group, 411 
Rational number, 32 
Ray in ordered set, 85 
Recursive definition, principle, 47, 54 
general principle, 72 
Reduced edge path, 507 
Reduced word, 413 
Refinement, 245, 305 
Regular covering space, 489 
is orbit space, 491 
Regularity, 195 
of G/H, 146 
of locally compact Hausdorff space, 
205 


of manifold, 227 
of orbit space, 199 
and perfect maps, 199 
of products, 196 
of subspaces, 196 
of topological groups, 146 
vs. complete regularity, 214 
vs. Hausdorff condition, 195, 197 
vs. metrizability, 215 
vs. normality, 195, 198, 203 
Regular Lindelöf space: 
metrizability, 218 
normality, 205 
paracompactness, 257 
Regular space, 195 (see also Regular- 
ity) 
Restriction: 
of a covering map, 338, 484 
of a function, 17 
of a quotient map, 137, 138, 140 
of a relation, 28 
Retract, 223, 348 
Retraction, 335, 348 
as quotient map, 144 
Represented by a word, 412 
Reverse of a path, 327 
Relation, 21 
Relation on a free group, 424 
complete set, 425 
pP. 122, 268 (see also sup metric) 
Ď, 124, 266 (see also uniform metric) 
R’, countable dense subset, 195 
Right coset, 330 
Right inverse, 21 
R”, 118 
closure in R®, 118, 127 
paracompactness, 260 
R” in box topology: 
is Baire, 300 
complete regularity, 213 
R7 in product topology: 
is Baire, 300 
countable dense subset, 195 
metrizability, 133 


R/ in product topology (cont.) 
normality, 203 
paracompactness, 257 

R” in uniform topology, 124 
is Baire, 300 
completeness, 267 

Rx, 82 
connectedness, 178 
separation axioms, 197 
vs. standard topology, 82 

Re, 82 
countability axioms, 192 
metnizability, 194 
normality, 198 
paracompactness, 257 
vs. standard topology, 82 

RŽ, 193 
complete regularity, 212 
Lindelöf condition, 193 
paracompactness, 257 
separation axioms, 198 

R”, 38 
basis, 116 
compact subspaces, 173 
local compactness, 182 
metrics for, 122, 123 
paracompactness, 253 
second-countability, 190 

R” — 0, 156 
fundamental group, 360 
path connectedness, 156 

R”, 38 

R” in box topology: 
components, 162 
connectedness, 151 
metrizability, 132 
normality, 205 
paracompactness, 205 

R® in product topology: 
completeness, 265 
connectedness, 151 
local compactness, 182 
metrizability, 125 
paracompactness, 257 
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second-countability, 190 
R® in uniform topology: 

components, 162 

paracompactness, 257 

second-countability, 190 
R?, standard topology, 87 
R? — 0, 325 

covering space, 340 

fundamental group, 360 
Rule of assignment, 15 
Russell’s paradox, 62 


S 
Sa (section of well-ordered set), 66 
Saturated set, 137 
Scheme, 449 

projective type, 463 

proper, 463 

torus type, 463 
Schroeder-Bernstein theorem, 52 
Schoenflies theorem, 392 
Second category set, 295 
Second coordinate of ordered pair, 13 
Second-countability, 190 

of compact metric space, 194 

of C(1, R), 194 

of orbit space, 199 

and perfect maps, 199 

of products, 191 

of R and R” and R®, 190 

of Re, 192 

of R® in uniform topology, 190 

of subspace, 191 

of topological group, 195 

vs. countable dense subset, 194 

vs. Lindelof condition, 194 
Second-countable space, 190 (see also 

Second-countability) 

Section: 

of the positive integers, 32 

of a well-ordered set, 66 
Seifert-van Kampen theorem 426 

classical version, 431 

special case, 369 
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Semilocally simply connected, 494 
Separable, 192 (see also Countable 
dense subset) 
Separates points from closed sets, 218 
Separates a space, 378 
into n components, 378 
Separation, 148 
by continuous functions, 211 
Separation theorem: 
closed topologist’s sine curve in S?, 
393 
general, 380, 392 
simple closed curve in S?, 379, 390 
theta space in S?, 395 
Sequences, 38 
and closure, 130, 190 
and continuity, 130, 190 
Sequence lemma, 130 
Sequential compactness, 179 
vs. compactness, 179 
Shrinking lemma, 227 
general, 258 
Shrinking map, 182 
and fixed points, 182 
vs. contraction, 182 
o-compact, 289, 316 
a -locally discrete, 252 
a -locally finite, 245 
Simple closed curve, 379 
generates z; of R? — 0, 401 
separates S?, 379, 390 
winding number, 404, 406 
Simply connected, 333 
5S", 369 
star-convex set, 334 
tree, 508 
vs. locally simply connected, 499 
Simple loop, 404 
Simple order, 24 
Slice: 
in covering space, 336 
in product space, 167 
Smaller topology, 77 
Smallest element of ordered set, 27 


Smirnov metrization theorem, 261 
S" (unit sphere), 156 
compactness, 174 
fundamental group, 369 
path connectedness, 156 
simple connectedness, 369 
Sa, 66 
compactification, 242 
countable subsets, 66, 74 
existence, 74 
metnizability, 181 
paracompactness, 260, 261 
uniqueness, 73 
Se, 66 
metrizability, 181 
Sg x Sq, 203 
complete regularity, 212 
normality, 203 
paracompactness, 254 
S', 106 
covering spaces, 337, 482 
fundamental group, 345 
Sphere, unit, 139, 156 (see also y, 
Sorgenfrey plane, 193 (see also R? 
Square metric, 122 (see also p) 
Standard bounded metric, 121 
Standard topology: 
on R, 81 
on R?, 87 
Star-convex set, 334 
Stereographic projection, 369 
Stone-Cech compactification, 241 
existence, 239 
extension condition, 240 
metrizability, 242 
of So, 242 
uniqueness, 240 
of Z,, 242 
Straight-line homotopy, 325 
Strictly coarser topology, 77 
Strictly finer topology, 77 
Strict partial order, 68 
Strong continuity, 137 
Stronger topology, 78 


Strong induction principle, 33 
S?, 139 
as quotient space, 136, 139 
Subbasis, 82 
for order topology, 86 
for product topology, 88, 114 
Subgraph, 503 
Subgroup: 
of free abelian group, 412 
of free group, 514 
Subnet, 188 
Subsequence, 179 
Subset, 4 
Subspace topology, 88 
basis, 89 
compactness, 164 
complete regularity, 211 
connectedness, 148 
countable dense subset, 194 
first-countability, 191 
Hausdorff condition, 100, 196 
Lindelöf condition, 193, 194 
local compactness, 185 
in metric space, 129 
normality, 203, 205 
paracompactness, 254 
regularity, 196 
second-countability, 191 
topological dimension, 306 
vs. box topology, 116 
vs. order topology, 91 
vs. product topology, 89, 116 
Sum of groups, 407 
Sup A, 27 
Superset, 233 
Sup metric, 268 
vs. uniform metric, 268 
Support, 225, 257 
Supremum, 27 
Surface, 225, 370 
classification, 457, 469 
Surface with boundary, 476 
Surjective function, 18 
Symmetric neighborhood, 146 


Index 
System of free generators, 42 1 


T 
Theta space, 362, 394 
fundamental group, 432 
separates S?, 395 
T; axioms, 211, 213 
Tietze extension theorem, 219 
Tı axiom, 99 
vs. Hausdorff condition, 99 
vs. limit points, 99 
for quotient space, 141 
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Topological completeness, 270 (see 


also Complete metric space) 
Topological dimension, 305 
of closed subspace, 306 
of closed subspace of R“, 316 
of compact manifold, 314 
of compact subspace of R, 305 


of compact subspace of R^, 313 


of compact subspace of R? , 306 

of linear graph, 308 

of manifold, 316 

of ]-manifold, 308 

of triangular region, 352 

of 2-manifold, 308, 352 

of a union, 307, 308 

of (0, 1], 305 
Topological group, 145 

closedness of A - B, 172, 188 

complete regularity, 213 

covering space of, 483 

Hausdorff condition, 146 

normality, 207 

xı is abelian, 335 

paracompactness, 261 

regularity, 146 

second-countability, 195 
Topological imbedding, 105 
Topological property, 105 
Topological space, 76 
Topologist’s sine curve, 157 

components, 160 
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Topologist’s sine curve (cont.) 
connectedness, 156 
does not separate S?, 393 
path components, 160 
path connectedness, 157 
Topology, 76 
generated by a basis, 78, 80 
generated by a subbasis, 82 
Torsion subgroup, 412, 424 
Torus, 339 
covering space of, 339, 483 
equals doughnut surface, 339 
fundamental group, 371, 442 
as quotient space, 136, 140 
Torus-type scheme, 463 
Totally bounded, 275 
vs. equicontinuity, 277 
Totally disconnected, 152 
Tower, 73 
Transcendental number, 51 
Transfinite induction, 67 
Translation of RY, 310 
Tree, 507 
fundamental group, 508 
maximal, 509 
Triangle inequality, 119 
Triangulable, 471 
Triangulation, 471 
Trivial homomorphism, 335 
Trivial topology, 77 
Tube, 167 
Tube lemma, 168 
generalized, 171 
Tukey lemma, 72 
2-cell, 441 
2-manifold, 225 


topological dimension, 308, 352 


2-manifold with boundary, 476 
2-sphere, 139 (see also 5?) 
Tychonoff theorem, 234 

for countable products, 280 

for finite products, 167 

via well-ordering theorem 236 


U 
U(A,€), 177 
Uncountability: 
of P(Z4), 50 
of R, 177 
of transcendental numbers, 51 
of {0, 1}°, 49 
Uncountable set, 45 
Uncountable well-ordered set, 74 (see 
also SQ) 
Uniform boundedness principle, 299 
Uniform continuity theorem, 147, 176 
Uniform convergence, 131 
on compact sets, 283 
Weierstrass M-test for, 135 
Uniform limit theorem, 132 
converse fails, 134 
partial converse, 171 
Uniformly continuous, 176 
Uniform metric, 124, 266 (see also 
Uniform topology) 
completeness, 267 
vs. sup metric, 268 
Uniform structure, 292 
Uniform topology, 124, 266 
vs. box topology, 124 
vs. compact convergence topology, 
285 
Union, 5, 12, 36 
Unit ball, 135, 331 (see also B?, B”) 
Unit circle, 106 (see also S!) 
Unit sphere, 156 (see also $”) 
Universal covering space, 484 
existence, 498 
Universal extension property, 223 
Upper bound, 27, 70 
Urysohn lemma, 207 
strong form, 213 
Urysohn metrization theorem, 215 
Utilities graph, 308, 394 
nouembeddability, 396 


v 
Vacuously true, 7 


Value of a function, 16 
Vanish at infinity, 280 
Vanish precisely on A, 213 
Vector field, 350 
Vertex: 
of a curved triangle, 471 
of a linear graph, 308, 394, 502 
of a polygonal region, 447 


Ww 
Weaker topology, 78 
Weak local connectedness, 162 
vs. connectedness, 162 
Wedge of circles, 434, 435 
existence, 437 
fundamental group, 434, 436 
Wedge of spaces, 438 
Weierstrass M-test, 135 
Well-ordered set, 63 
compact subspaces, 172 
dictionary order, 64 
finite, 64 
normality, 202 
subsets well-ordered, 63 
uncountable, 66 
Z4, 32 
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Z+ x Z4, 63 
Well-ordering theorem, 65 

applied, 236, 246 

and axiom of choice, 67, 73 

and maximum principle, 70, 73 
Winding number, 398, 403 

as an integral, 405 

of simple closed curve, 404, 406 
Word, 412, 415 

reduced, 413 


x 
X/,113 
X”, 38 
X®, 38 
LX, Y), 330 


Z 
Z, 32 
Z4, 32 
not finite, 42 
well-ordered, 32 
Zermelo, 65 
Zorn’s lemma, 70 
applied, 72, 233, 236, 509 
vs. maximum principle, 72 


